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►7- Abstract. Although models are built on the basis of some observations 

of reality, the concepts that derive theoretically from their definitions 
as well as from their characteristics and properties are not necessarily 
direct consequences of these initial observations. Indeed, many of them 
rather follow from chains of theoretical inferences that are only based 
»— H on the precise model definitions and rely strongly, in addition, on some 

<^ consequential working hypotheses. Thus, it is important to address the 

question of which features of a model effectively carry some modelling 
meaning and which only result from the task of formalising observations 
q of reality into a mathematical language. In this article, we address this 

1 1 question with a theoretical point view that sets our discussion strictly 

between the two stages of the modelling process that require knowledge 
of real systems, that is, between the initial stage that chooses a global 
theoretical framework to build the model and the final stage that ex- 
ploits its formal predictions by comparing them to the reality that the 
£^ model was designed to simulate. Taking Boolean automata networks as 

^vQ instances of models of systems observed in reality, we analyse in this set- 

ting the remaining stages of the modelling process and we show how the 
meaning of theoretical concepts can subtly rely on formal choices such 
as definitions and hypotheses. 

Keywords: Boolean automata network, update schedule, dynamical be- 
• • haviour, transition graph, modelling, synchronous and asynchronous tran- 

sitions. 
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1 Introduction 

The manipulation of mathematical objects requires the design and elaboration 
of precise formal definitions in relation to these objects and to their properties. 



Choosing these definitions, specifying consistent connections between them as 
well as, possibly, restricting and refining them a posteriori depends closely on 
the original purpose of the objects that are considered. Some theoretical ob- 
jects such as Boolean automata networks may be regarded and studied plainly 
as mathematical objects that are disconnected from any modelling considera- 
tions [1-4]. And they can also be considered as models of other systems, possibly 
more complex systems such as those encountered and observed in reality [5-8]. 
Of course, the approaches that follow from these two viewpoints are not in- 
dependent. On the one hand, formal studies may - and sometimes need to - 
be fed by practical interrogations that arise from more applicative contexts [9, 
10]. On the other hand, by definition, modelling aims at inferring properties of 
a real system by exploiting a knowledge of the properties of its mathematical 
model [11, 12]. Yet, with the first point of view, the mathematical objects are 
studied per se. The choices of formalisation and, more generally, the methodol- 
ogy that is adopted mainly aim at building or expanding a purely theoretical 
understanding in the domain at hand. In particular, possible restrictions to the 
scope of a study are usually brought for the simple sake of convenience in formal 
developments. On the contrary, the second point of view introduces motivations 
of a different nature. Indeed, with the ultimate aim of simulating or explaining a 
portion of reality, the choices of formalisation become oriented both by a prede- 
fined or intuitive interpretation of mathematical notions and by, conversely, the 
intention of representing given features of reality. As a consequence, the second 
point of view yields a notable difficulty of which the first purely formal strategy 
is immune. It requires to navigate safely but constructively between experiences 
of reality on one side, and mathematical abstractions of it, on the other. 

We propose to investigate this central difficulty of the modelling process with a 
theoretical stance. Thus, considering the second point of view mentioned above, 
we do not aim at deriving new mathematical results describing the properties 
of models. Conversely, we do not either address the question of choosing the 
formal framework to build models for a given category of real systems. Instead, 
we consider problems in which the general lines of the theory to be used for 
modelling are assumed to have been set a priori and accepted definitely. This 
way, without needing any thorough knowledge on the systems that are intended 
to be modelled, we can focus on the medial part of the modelling process, i.e., the 
part that consists in translating reality into abstraction and, dually, confronting 
abstraction to reality within a predefined formal framework. 

In this article, the framework that is considered is that of Boolean automata 
networks. It serves as a basis to the rest of our discussion. This results in three 
notable restrictions to the scope of our arguments. First, we consider only the 
modelling of real systems that are known or supposed to be networks of inter- 
acting elements. Second, we suppose that neither the occurrence of events nor 
the mechanisms that are responsible for them can be observed directly. Only the 
outcome of events is considered, i.e., only the states (or sequences of states) in 
which the network elements end up, as a result of unobserved events, are effec- 
tively observed. Thirdly, we assume that elements in the networks have two dual 
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(or possibly extreme) states that can be modelled by theoretical entities, namely 
Boolean automata, that can only take two different states, and 1. In most cases, 
this last restriction may appear as an excessive oversimplification of reality. If the 
elements in a network do take more than two states, then, it is probable that the 
whole range of their different states and the subtle nuances between them impact 
appreciably on the behaviour of other network elements and, a fortiori, on the 
global network behaviour. Consequently, the system may satisfy properties that 
are likely to elude a "Boolean modelling" which can only, by nature, focus on the 
roughest and the most obvious events, such as switches between two extreme 
states. For this reason, it may be argued that in some cases, a modelling that 
uses multi-state automata rather than Boolean automata is better suited [13-18]. 
However, we believe that in many cases, precisely because Boolean models can 
only hope to produce information on very basic, global properties of a system, 
their explanatory scope may be less ambiguous and more easy to exploit than 
that of more refined models that account for more subtle and complex properties 
by relying on a wider range of parameters. Thus, although the knowledge that 
Boolean modelling helps to develop is much more qualitative than quantitative 
and certainly very partial, we believe that it potentially provides reliable, well- 
bounded information or, at least, insights on existing causal relationships, that 
can serve as solid grounds for further and finer modellings. As mentioned above, 
however, our objective here is not to discuss the question of the pertinence of 
a general theoretical framework used to represent a part of reality. Thus, from 
now on, the Boolean framework is supposed to be accepted and Boolean au- 
tomata networks are taken to be effective possible satisfactory models of real 
systems such as, for instance, sets of genes in a cell, interacting via their protein 
products [19-22]. 

In the next section, Section 2, we list the main characteristics of these networks 
that are considered in the literature. This lays the grounds of what we call the 
"theory of Boolean automata networks". Then, in Section 3, we analyse the mod- 
elling of time and causality. This aims at illustrating some problems that arise 
within the "theorisation" step of the modelling process which consists in com- 
pleting the definition of the theory to be used for modelling and simultaneously 
specifying a correspondence between the modelled features of a category of real 
systems and the modelling features of the theory. The next section, Section 4, 
deals with the effective modelling of real systems. It considers the problem of 
observing the behaviour of a particular real system (belonging to the category 
of real systems for which the theorisation was intended) and deriving from the 
resulting, necessarily partial information an understanding of the causes of the 
events that are observed. Both Sections 3 and 4, in the respective contexts of 
theorisation and effective modelling, focus on hypotheses that follow naturally 
and inevitably from choices involved in the formalisation of reality and from the 
necessity to bypass the incompleteness of the information that is available con- 
cerning a certain part of reality. Finally, the last section of this article, Section 5, 
emphasises the importance of these hypotheses and addresses the more general 
issue of the ins and outs of modelling with Boolean automata networks. 
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2 Main features of Boolean automata networks 

Informally, an automata network of size n involves a set of n multi-state elements 
interacting with one another. The elements are called automata 4 [4,23-25]. In 
the general case, their set of possible states is any (finite) discrete set. In the 
present particular case of Boolean automata networks, they are supposed to take 
only two possible states, (inactive) and 1 (active). The interactions between 
automata of a network consist in influences of some automata states on other 
automata states. For the sake of simplicity, we abuse language and rather speak 
of influences between automata. When or under what circumstances do these 
influences effectively produce changes of automata states is a non trivial question 
that can be answered several ways according to the purpose of the study. The 
present section aims at completing this rough description and setting up the 
backbone of the "theory of Boolean automata networks" by enumerating the key 
features of these networks. 

2.1 States and configurations 

First of all, we introduce some conventions and notations. In the sequel, unless 
specified otherwise, the Boolean automata networks that are considered are sup- 
posed to have size neN and their automata are assumed to be numbered from 
to n — 1. The set V = {0, . . . , n — 1} refers to the set of network automata. 
As mentioned above, automata are supposed to have only two possible states. 
The binary set containing these two states is denoted by B = {0, 1}. Global 
states of networks, called configurations in the sequel, are vectors of the set B™. 
If x = (xo, . . . , x n -\) € B™ is the network configuration, then the i th component 
Xi € B of this vector is the state of the i th network automaton. In our context, 
focus is put especially on switches of automata states starting in a given net- 
work configuration. For this reason, the following notations concerning network 
configurations will be useful 5 : 

\/x = (ar ,...,a; n _i) € B™, 

(1) Vz e V, x 1 = {x , . . . ,Xi-i,-iXi,x i+ \, . . . ,x n -i), 

(2) VW = W W {i} CV,x w =¥ W = W 71 and, (!) 

(3) x — x v — (~>x , ■ ■ ■ , ->x n -i). 

2.2 Structure of a Boolean automata network 

To describe a Boolean automata network, it is often convenient to start by 
representing its underlying interaction structure by a digraph Q = (V, A). This 

4 As detailed later, the reason why the term automata is used here to refer to network 
elements is that these elements are supposed to compute output data, that is, their 
own state, from some given input data, that is, the states of other network automata, 
tt) denotes the disjoint reunion of sets (A = B a C o- [A = B U C and B n C = 0] ) 
and -i denotes the negation of a Boolean value (-i0 = 1 and — il = 0). 
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digraph is called the interaction graph of the network. Its set of nodes V is 
assimilated to the set of automata of the network and its set of arcs A represents 
the set of interactions that take place in it. More precisely, an arc G A of 
this digraph represents the influence that (the state of) automaton j E V may 
possibly have on (the state of) automaton i G V. Let us note that for the arc 
(j, i) to belong to A, node j is not supposed to have a constant effective impact 
on i. It is merely supposed to have an impact in some network configurations 
and in at least one of them (see Equation (2) below). In some works, a digraph 
Q{x) = {V,A{x)) is defined for every configuration x G B™ [17,21,26-28]. It 
contains arcs (j, i) G A(x) such that in x, j does indeed have an appreciable 
influence on i. This way, the set of arcs of the interaction graph Q equals A = 
U 6 b» A(x). 

Example 1 . The following figure represents an interaction graph Q = (V, A) 
where V = {0,1,2} and A = {(0, 1), (1, 1), (1, 2), (2, 1)}. The network whose 
structure it represents thus contains three interacting automata. Automaton 
is influenced by no automaton of the network which means that it always tends 
to take the same state. Automaton 1 is influenced by all three of the network 
automata, including itself. And automaton 2 is influenced just by automaton 1. 




2.3 Local transition functions 

The interaction graph of a network gives the existence of the oriented interactions 
that it involves. However, it does neither specify the nature of these interactions 
nor the conditions under which they effectively occur. This is done by assigning 
a local transition function : B n — »■ B to each automaton i G V so that the 
following is satisfied: 

3x e B™, fi(x) ? f l {x i ) (j, i) g A. (2) 

With this new definition, the digraphs G(x), x G B" mentioned above contain 
the following set of arcs: A(x) = {(j, i) \ fi(x) ^ /i(x 3 )}. 

Example 2. In agreement with the interaction structure of Example 1, automata 
0, 1 and 2 could, for instance, be assigned the following local transition functions, 
respectively: 

Vx G B™, f (x) - 1, h{x) = Xl y (x A -ix 2 ), / 2 (a0 = ^x\. * 

2.4 Events and updates 

In any network configuration, zero, one or several elementary or punctual events 
may take place. Here, we consider punctual events that consist in the update of 
one or several automata states. We call them respectively atomic and non-atomic 
updates and define them formally below. 
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Supposing that the network is currently in configuration x € B™, we say that 
automaton i G V is updated if its state switches from a;,, its current state, to 
its new state. Let us note that, possibly, fi(x) = xi so that the update 
of i is not effective in x. In any case, this local event yields a global network 
configuration change (possibly not effective) which is described by the i-update 
function :B"^B": 



Vx e B™, Fi(x) = (x , . ..Xi-i,fi(x),x i+ i, . . .,ar„_i). 



(3) 



This event is said to be atomic because it involves only one automaton. We 
also consider non-atomic events that correspond to the simultaneous update of 
several automata. In the general case, the W -update function 6 F\y : B" — > B™ 
describes the network configuration change that results from the update of all 
automata in an arbitrary set W C V: 



\fx G B", Vi G V, F w (x)i = 



fi(x) ifieW, 

Xi otherwise. 



(4) 



Example 3. The following table defines the update functions F\ and -F{o,2} for 
the network considered in Examples 1 and 2: 



X = (x ,Xi,X2) 


fo(x) 


h(x) 


h(x) 


Fi(x) 




(0,0,0) 


1 





1 


(0,0,0) 


(1,0,1) 


(0,0,1) 


1 





1 


(0,0,1) 


(1,0,1) 


(0,1,0) 


1 


1 





(0,1,0) 


(1,1,0) 


(0,1,1) 


1 


1 





(0,1,1) 


(1,1,0) 


(1,0,0) 


1 


1 


1 


(1,1,0) 


(1,0,1) 


(1,0,1) 


1 





1 


(1,0,1) 


(1,0,1) 


(1,1,0) 


1 


1 





(1,1,0) 


(1,1,0) 


(1,1,1) 


1 


1 





(1,1,1) 


(1,1,0) 



Let us emphasise that the punctuality of events mentioned above refers to their 
happening in a unique step whereas the atomicity of events characterises their 
nature. All atomic as well as all non-atomic events are punctual. No other punc- 
tual events are considered here but the next paragraph mentions more general 
events consisting in series of successive punctual events. 



2.5 Transitions and paths 

Network transitions are couples (x, y) G B" x B™ that represent changes of 
network configurations (from x to y) due to the occurrence of one or a series 
of punctual events. Transitions that involve only one punctual event are called 
elementary. They satisfy y = F w (x) for some (possibly empty) set W C V of 
automata and are denoted as follows: 



-> y, x 



w 



-> y or x 



-w- 



Vi G V, Fi obviously equals F^y but in this paper, we prefer the first notation. 
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There are two main types of elementary transitions. Asynchronous transitions 
correspond to atomic updates. Synchronous transitions correspond to non-atomic 
updates. When emphasis needs to be put on the asynchronicity (resp. synchronic- 
ity) of an elementary transition x — M — > y = Fi(x) (resp. x — w — > y = 
F\y{x), \W\ > 1) it is rather written: 

i ( W \ 

x > y, x > y or x — i — > y (resp. x ► y, x ► y or x — w — ► y) . 

General network transitions (x,y) eB" x B™ are sequences of zero, one or sev- 
eral elementary transitions. They are denoted using the reflexive and transitive 
closure » of > and defined formally by: 



x » y <=> 

IeN, 3a; 1 ,..., x 1 ' 1 el™, x > x 1 > ... > x^ 1 > y. (5) 

Any network transition x » y thus corresponds to an ordered list of sets 

{W t )i<t<i such that y = Fyy e o . . . o F\y 2 o Fw 1 (x). When this list is known, 
we use the following notation to specify the sequence of punctual updates in 
question: 

Wi,W2,...,w t 
x ■ — » y. 

Now, for the discussion that follows, it is important to note that because tran- 
sitions involve no other events than automata updates, some situations need to 
be disregarded. The most basic example is the following: 

x > y where 3i e V, Xi = fi(x) ^ y t . 



Thus, for the network of Examples 1 to 3, the elementary transition 

(0,0,0) > (1,1,0) 

is impossible although the network can, all the same, perform the non-elementary 
transition 

(0,0,0) » (1,1,0) 

by carrying out the sequence 

(0,0,0) > (1,0,0) > (1,1,0). 

Similarly, suppose that in configuration x, two allegedly elementary transitions, 
x — w — > y and x — W — > y' , are possible. Because any automaton i that 
is updated by both transitions necessarily takes state fi(x) in both resulting 
configurations y and y' , it must hold that Mi e W n W , yi — y[ — fi(x). 
Consequently, the following situation is also impossible: 

5, V 

w 

x where 3i € W H W, yi ^ y',. 

w ' , 
V 
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However, again, a similar situation might be possible if the transitions x » y 

and x » y' are not supposed to be elementary. Thus, as discussed further 

in Section 4, the nature of transitions observed is an essential precision in the 
observation of the behaviour of a network. To determine it, prior knowledge on 
the network local transition functions is required. 

Paths (usually called trajectories in the context of dynamical systems) are or- 
dered lists of network transitions (x°, x 1 ), (x 1 , x 2 ), . . . , {x l ~ x ,x l ), simply written 
as follows: 

n i 9 f— 1 f 
x » x » x » . . . » x » x . 

Since transitions are either elementary or non-elementary, the definition of an 
arbitrary path may involve steps corresponding to punctual events as well as 
steps corresponding to undetailed series of punctual events (see Example 4). 
Paths that involve only elementary transitions are said to be elementary. 

Example 4- The path bellow involves two elementary transitions, x° > x 1 = 

Fi{x) and x 2 > x 3 = F\y(x 2 ), as well as one non-elementary transition, 



x » x 2 , which could itself be broken into a path of several elementary tran- 
sitions if the updates it involves were known: 

i 1 2 W 3 w 
X U > x L » x 2 > X A . * 

2.6 Update schedules 

An update schedule 6 of a set V of automata (or, by extension, of a network 
whose set of automata is V), is defined by an ordered (finite or infinite) list 
{W t )tes (S C N) of non-empty sets of automata (Vi e S, ^ W t C V). We 
write S = (Wt)tes or just 8 = Wo, W±, . . . , Wt, ■ ■ ■■ Under an update schedule 
S = (W t )tes, starting in configuration x <G B™, a network takes sequentially the 
configurations x° — F Wo (x), x 1 — F Wl o F Wo (x), . . ., x l — F Wt o . . . o F Wo (x), 
. . ., i.e., it follows the elementary path: 



w Wi w t w t+1 
> F Wo (x) > ■ ■ ■ > F Wt o . . . o F Wo (x) > 



(6) 



In particular, S only allows the network to perform elementary transitions that 
update one of the sets Wt, t € S. If U C V is a set of automata that differs from 
all of these sets (U ^ Wt, 6 S), then x — u — > Fjj(x) is not an elementary 
transition that can be done under 5. In addition, the sets W t , t € S themselves 
cannot either be updated in any network configuration. To detail this, let us 
introduce by induction on t G S the sets X t — {x \ x — w t — > F\y t (x) is 
allowed by <5} C B": 



Vt G S, X t+ i = {F Wt {x) | x e X t } = F Wt {X t ). 



(7) 



Example 5 below illustrates that for any configuration x X t , by definition of 
X t , the elementary transition x — w t — > Fw t {x) is not possible according to 6. 
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Example 5. Suppose that the network considered in Examples 1 to 3 is updated 
by the periodic update schedule 8 = {1}, {0, 2}, {1}, {0, 2}, . . .. Then, since 8 does 
not allow the atomic update of automaton 2, any elementary transition of the 
form (0,a;i,a;2) — {2} — > (0, x\, -*xi) is impossible. The sets X t , tGN defined 
above can be shown to equal the following: X$ = B 3 , X\ = B 3 \ {(1, 0, 0)} and, 
Vi > 2, X t = {(1, 0, 1), (1, 1, 0)}. As a consequence, in configuration (1,0,0), 8 
does not either allow the set {0, 2} to be updated. 

As mentioned in Section 2.5, our choice of definitions imposes that certain net- 
work behaviours be baned. Adding the supplementary constraint of an update 
schedule restricts further the situations that may be considered possible. For in- 
stance, the following situation in which x belongs to both the sets X% and X§ (see 
Equation 7 above) is consistent with the update schedule 8 only if Vi G W3 n W$ 
it holds that y i = Zi = fi(x): 



x = F W2 (y 

= F Wi (z 4 ) 




F w& (x) 
F W3 (x) 



and generally, for any subsets Wt, Wt' C V that belong to the defining list of an 
update schedule 8, the following must hold: 

e B™, [y = F Wt (x) and z = F Wt , (x)] => [Vi € W t n W v , y t = z t = f t {x)] . 
Thus, again, the set of local transition functions of a network updated with a 
given update schedule needs to be known in order to determine the trajectories 
that are possible. 

Periodic update schedules of arbitrary period p <G N correspond to infinite pe- 
riodic lists Wo, Wi, . . . , W p _i, W , W\, . . . , W p -i, . . . (e.g., the update schedule 
of period 2 in Example 5). For the sake of simplicity they are rather defined by 
finite ordered lists (W^teN/pN of size p: 8 = Wo, W\, . . . , W p -\. We define global 
transition functions F[S] : B™ — > B™ relative to such update schedules: 



V.t g B™, F[S](x) = F Wp _ 1 o...oF Wl o F Wq (x). 



(8) 



The definition of this function allows to focus on series of p elementary transitions 
rather than on single elementary transitions so that Equation 6 can be simplified 
to the following (non necessarily elementary) path: 

x » F[8](x) » F[S] 2 (x) » . . . » F[S] k (x) » 

where F[8] k denotes the k th iterate of F[8]. 

Now, let w be the equivalence relation that relates periodic update schedules that 
differ only by a rotation of their sequence of updates. For instance, under this re- 
lation, the periodic update schedule defined by the list {1}, {0, 2}, {1, 2} is equiv- 
alent to the two periodic update schedules defined by the lists {0, 2}, {1, 2}, {1} 
and {1, 2}, {1}, {0, 2}. Consider two equivalent update schedules 5 = Wo, W\, . . . , 
VF p _i and 8' = Wq, W[, W' V _ Y satisfying Mt e N/pN, W{ = W t+A for some 
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A E N/pN. Then, let us note that any elementary path starting in x G B n 
under 8 becomes identical, at its A th step, to the elementary path that starts 
in F Wa i o . . . o F Wo (x) under 8'. Thus, except for A elementary transitions at 
the beginning of each path, 8 and 8' yield exactly the same network behaviours. 
Focusing on non-elementary transitions representing series of p elementary tran- 
sitions, however, the two update schedules yield very different paths that cannot 
generally be identified at all from the point of view of an outside observer that 
knows nothing about the update schedule that is used. 

Let us also note that periodic update schedules 8 may also be defined as functions 
8 : V — > V(N/pN)), where V(S) refers to the power set of a set S (see Example 6 
below). This way, for any automaton i e V, 8(i) is the set of updates involving 
node i in the periodic sequence (W t ) te m/pN sucn that: 

Vt G N/pN, i G W t «• t G <f(i). 

And since each subset W t , with i G N/pN, must be non-empty in order for the 
update schedule to effectively have period p, 8 must satisfy: 

VteN/pN, 3ieV, te8(i). 

A well-known instance of periodic update schedules are block-sequential update 
schedules [4, 23, 29-33]. Their particularity lies in that their sequence of updates 
involves exactly once each automaton of the network. Thus, they can be defined 
either by a finite list (W t ) teN / pN such that V = l+J teN / pN W t or, abusing notations 
introduced above, by a function 8 : V — > N/pN (see Example 6). The parallel 
update schedule is the unique block-sequential update schedule of period p = 1 
(Vi G V, S(i) = 1). It updates all nodes of the network in one step, simulta- 
neously. Sequential update schedules are block-sequential update schedules with 
period equal to the size of the network {p — n). They update only one node of 
the network at a time (Vt G N/pN, \Wt\ = 1). Let us mention here that identi- 
fying the block-sequential update schedules that are equivalent under s=s reduces 
the number of these update schedules 7 by a factor that tends towards 2 "'" 2 . 

A larger class of periodic update schedules that contains all block-sequential 
update schedules is the class of simple update schedules (mentioned later in Sec- 
tion 2. 10 for their particularity) . This class contains all periodic update schedules 
that do not update any automata more than once within each period. With the 
functional notation introduced earlier on Page 10, simple update schedules are 
defined as the periodic update schedules satisfying: 

Vi G V, \8{i)\ < 1. (9) 

7 Proof. Let S(n, k) for k < n count the number of surjective applications from a set 
of n elements to a set of k. The number of block-sequential update schedules of a set 
of n automata equals bs„ = J] < fe<n (™)-BS fc = Ei< fc <„ S(n, k) ~ \ ■ (fw ^j„ +1 [34] 
(sequnce A670 in [35]). The number of equivalence classes for the relation w can be 
shown to equal Ss n = J2i< k <n ^r^. Thus, from S(n+1, k) = k(S(n, k)+S(n,k-l), 
results that BS n+ i = 2 x BS n and BS n ~ 2 ' 1 ™ 2 x BS n . 
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Finally, let us introduce another generalisation of block-sequential update sched- 
ules. Fair update schedules [36, 37] are the periodic update schedules that update 
each automaton at least once (see Example 6). Unlike block-sequential update 
schedules, they may update some automata more often than others. A fc-fair 
update schedule of period p is an update schedule S : V — > V(N/pN)) such that 
for all automata i and j of the network, the following holds: 

\S(i)\<k-\S(j)\, 

i.e., within each period, i is not updated more than k times as much as j is. 
Block-sequential update schedules are a special type of 1-fair update schedule. 

Example 6. Consider a network of size n = 6. The 3-fair update schedule S = 
{2,5}{0, 1,4}{1,2,3}{0, 1,4,5}, the block-sequential update schedule (3 = {2} 
{3, 4}{0, 1, 5}, the sequential update schedule a = {5}{3}{1}{0}{2}{4} and the 
parallel update schedule tt = {0, 1, 2, 3, 4, 5} can be defined as functions: 



S : < 



V -> P(N/4N) 
{1,3} 
{1,2,3} 
{0,2} 

{2} 
{1,3} 
{0,3} 



i y 

1 i y 

2 i y 

3 i y 

4 i y 

5 i y 



V -> N/3N 
ie {0,1,5} ^ 2 
2 i y 
i £z } 3 , 4} i y 1 



a : < 



( V -> N/6N 
04 3 

1 ^ 

2 4 

3 ^ 

4 4 

5 4 



N/1N 




2.7 Automata stability and transition effectiveness 

As mentioned above, when an automaton is updated in a given configuration 
x G B™, it does not necessarily change states. We define the set U{x) of automata 
that can indeed change states in x and that do if and only if they are updated: 

U{x) = {ieV \ fi(x) ? Xi }. 

The automata in U(x) are said to be unstable in x and those in U{x) = V\U{x) 
are said to be stable in x. It is important to note that the only couples (x, y) G 
B™ x B™ that indeed are elementary network transitions are the couples that 
satisfy: 

D(x,y) = {ieV | Xi^yi} C U{x). 
It also is important to note that for any set W C V, the following holds: 

VxeB", F w {x) = F wnu{x) (x) = x wnu ^. 
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As a consequence, the elementary transitions 



w 



■» F w (x) and x 



wnu(x) 



.7; 



perform different updates but produce identical effects. Elementary transitions 
x — w — > y such that W D U (x) — and x = y are called null transitions. 
Elementary transitions x — w — > y such that W C U(x) and y = x w are 
called effective transitions. Other elementary transitions are said to be partially 
null or partially effective. 

2.8 State transition systems and transition graphs 

A transition graph of a network is a digraph T = (X, T) that represents its 
global behaviour. Its nodes are network configurations and its arcs are network 
transitions. There are several important characteristics that a transition graph 
may have. A first characteristic concerns the nature of the transitions contained 
in the digraph. If all arcs are elementary transitions, then the transition graph 
it is said to be elementary. 

Among the set of elementary transition graphs that may be associated to a 
network of size n is its general transition graph [38], GTG for short, denoted by 
7g = (B™, T G ). Its set of nodes is B™ and its set of arcs is the set of all elementary 
network transitions: 



Every node x G B™ in this multiple digraph has out-degree deg + (x) = 2™ — 
1. Another notable elementary transition graph is the asynchronous transition 
graph, or ATG for short, ft is is the spanning sub-graph 7a = (B™, T A ) of 7g whose 
set of transitions equals the set of asynchronous transitions of the network: 



and in which each node x G B" has out degree deg + (.x) = n (the size of the 
network) . Arbitrary transition graphs that contain only asynchronous transitions 
are also called asynchronous [14, 26, 27, 39-44]. Because updates are not always 
effective, generally, an elementary transition graph Tis a multiple digraph. Its 
simple version is called its effective version and is denoted by T s . In this digraph, 
all arcs [x, y), x 7^ y which are not loops are effective transitions, i.e., D(x, y) C 
U{x) (see Example 7 below). The effective version of the GTG is refered to as the 
GTG cff , and that of the ATG as the ATG cft . 

N on- elementary transition graphs which contain non-elementary transitions are 
often used to describe the behaviour of a network that is updated according to a 
certain periodic update schedule (see Example 7). To a periodic update schedule 
S = (W t )t£N/ P N, we associate the digraph T$ of F[S] (see Equation 8): 



Tg = \J{(x, F w {x)) I x G B", l^^CV}. 



T A = \J{(x,F t (x)) I x G B™, i G V}. 



Ts = (B n ,T tf ) where T s = {(x,F[5](x)) \ x G B™}. 



(10) 
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Let us highlight that a transition graph T = (X, T) does not necessarily have 
a node set X included in B". Indeed, when the network behaviour is context- 
dependent, i.e., when the transitions that are possible in a given configuration 
depend on the paths that lead to that configuration, in order to avoid loosing 
any information, it may be required to define X as a multi-set on B™. This is the 
case for elementary transition graphs that describe the behaviour of networks 
that are updated with periodic update schedules (see T S ELEM in Example 7): if 
configuration x belongs to k different sets X t (see Equation 7), then, because 
the network behaviour is deterministic, any elementary transition graph that 
represents exhaustively the network behaviour needs to contain k copies of x 
(consider merging nodes representing the same configuration in the digraph JJ LEM 
of Example 7). Thus, to a periodic update schedule S = (Wt)t£N/pN> we a ^ so 
associate the "elementary version" of Ts , namely the elementary graph T S ELEM 
which contains all elementary paths of the network updated with 5 . Its set of 
arcs equals set of transitions {(x,Fw t {xj) € X t x X t +i}. 

When, on the contrary, the network behaviour at one step is independent of 
its behaviour at all previous steps, we say that it is context-free (or memory- 
less). In this special case, the transition graph T = (X,T) defines exactly a 
state transition system [45-47], that is, a digraph whose set of nodes IC1" 
is the set of states of the system and whose set of arcs T C X x X is its set of 
transitions. Contrary to T S ELEM , the transition graphs Ts, To, Tg cS , 7a and Ta" s 
mentioned above as well as any of their sub-graphs all are examples of state 
transition systems. 



Example 7. Consider the network of Examples 1 to 3. Its GTG ofi is the digraph 
represented below where the set of arcs represented with thicker lines equals the 
set of arcs of the ATG ofi of the same network. 



{1,2} 

(0,0,1) —{o}- 
t 

{2} {0,2} 



{0,1,2} 

(1,0,1) 
* t 

{2} 



{0,1} 
(1,1.1) 



{o}— (0,1,1) 



{1,2} 



{0,2} 



{0} 



.{!}■ 



{0}> 



{2} 

(1,1,0) 

T \ 

{0,1,2} '{1,2} 

The transition graph Ts of the network of Examples 1 to 3 associated to the 
update schedule given in Example 5 is the following: 



(0,0,0) 



(1,0,0) 



{2} 

\ 

(0,1,0) 



(0,0,0) ^ nnn (0,1,0) 

(0,0,1) **W> L ) ( o,i,i) 

and the corresponding elementary transition graph Tg 



(1,1,0) 



(1,0,0) 

(1,1,1) 



is: 
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(0,0,0) > (0,0,0) (1,0,1) 

(0, 0, 1) > (0, 0, 1) > (1, 0, 1) 

(0,1,0) > (0,1,0) > (1,1,0) 

(0,1,1)- (0,1,1) - (1,1,0) 

{1} {0,2} 

(1,0,0) > (1,1,0) > (1,1,0) 

(1,0,1) > (1,0,1) > (1,0,1) 

(1.1.0) > (1,1,0) > (1,1,0) 

(1.1.1) > (1,1,1) > (1,1,0) 



2.9 Network behaviours 

If T = (X, T) is the transition graph representing the global behaviour of a 
given network, then, any sub-graph of T represents a particular behaviour of 
that network. For instance, if a network is supposed to be potentially able of 
performing any elementary transition, then, T either equals the GTG 7g of the 
network or its GTG ofT Tg" s and any elementary transition graph represents a 
possible particular behaviour of the network. 

Let us focus on state transition systems T= (B",T). For these, we define the 

binary relation T* C » as the reflexive and transitive closure of the relation 

T, that is, (x, y) £ T* if and only if there exists in Ta path from x to y. Transient 
configurations are then defined as the configurations i£B" that satisfy: 

3y£M n , (x,y) £ T* A (y,x) £ T* . 

Any behaviour of the network that involves transient configurations is said to be 
transient itself. Configurations that are not transient are called recurrent. These 
configurations are precisely those that induce the terminal strongly connected 
components of T, called limit behaviours here (and rather called attractors in the 
context of deterministic dynamical systems [48-50]). There are two main types 
of limit behaviours. Those that contain strictly more than one configuration are 
called (sustained) oscillations [51] (or limit cycles when Tdefines a deterministic 
dynamical system). Limit behaviours of size one are called stable configurations 
(or fixed points when Tdefines a deterministic dynamical system 8 ). They are 
characterised by their out-going effective degrees 9 being equal to 0. Note that 
in a stable configuration x, it is not necessary that all automata of the network 
be stable (i.e., that U(x) = {i £ V | fi(x) ^ Xi] be empty). Indeed, for x to be 
stable, it suffices that T contains no transition that corresponds to the update 
in x of automata belonging to U(x) (this happens in particular when T— Ts). 

8 In this case, Tis the graph of a global transition function F : B n — > B" so any stable 
configuration in this graph is a fixed point of that function. 

9 In any transition graph T, the out-going effective degree of configuration x is the 
number of arcs out-going x that are effective transitions. 
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2.10 Relationships between network features 

In the sequel, we informally call observer of a network "anyone" that has full, 
partial or no knowledge of its characteristics (structure, set of interactions, be- 
haviour. . . ). In this section, we discuss how the different features of the network 
relate and how an observer carrying one type of information on the network may 
derive additional information of another type. 

First, the information carried by the interaction graph (as defined in Sections 2.2 
and 2.3) of a network is contained in the information carried by its set of local 
transition functions. However, it takes exponential time in the size n of the 
network to draw an interaction graph from a set of local transition functions. 
This complexity stands even when the local transition functions are given in 
conjunctive normal form (CNF). Indeed, given j £ V and the CNF definition of 
fi, the problem of determining whether there exists x £ B" such that fi(x) =/= 
fi(x J ) is NP-complete: the CNF version of SAT [52,53] which is NP-complete 
can be reduced to it. 

Next, let us emphasise that with the sole knowledge of the set of local transition 
functions of a network, many different transition graphs may be built. To choose 
one of these graphs as representing the actual network behaviour therefore re- 
quires additional information. This additional information may simply be, for 
instance, the datum specifying that all elementary network transitions are pos- 
sible, or at least that no elementary network transitions are known or considered 
to be impossible. In this case, either the GTG or the GTG cft (which can both be 
built in time Oin ■ 2 2 ™)) need to be chosen. They are the transition graphs that 
represent the alleged network behaviour the most completely. If the network is 
known to perform only asynchronous transitions, then the ATG, the ATG ofi (which 
can both be built in time 0(n-2 n )) or sub-graphs of these are better suited. And 
for networks that are supposed to be updated with a given update schedule 5 of 
period p, either JJ LEM or 7s (which can both be built in time 0{np-2 n )), must be 
considered. Thus, in short, in addition to the knowledge of the set of local tran- 
sition functions, to derive the transition graph of a network, it is necessary (but 
not always sufficient) to specify the nature of the possible transitions or paths 
of the network. Without any such indication, the network behaviour cannot be 
inferred non-ambiguously even from an exhaustive knowledge of its underlying 
mechanisms (i.e., its structure and interactions). 

Conversely, knowing the transition graph of the network is not sufficient to in- 
fer its set of local transition functions and its structure [4,23,30,54]. In some 
cases, however, with some simple additional information, it is [55,56]. Indeed, 
first, suppose that the network behaviour is known to be fully described by a 
deterministic state transition system T= (B n ,T) in which all nodes have out- 
degree at most 1 and all transitions are of the form (x, F(x)) for a certain global 
transition function F : B" — > B™. Then, from Equation 11 below, the set of local 
transition functions T = {fi \ i £ V} of the network can be derived in time 
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0(n 



•2"), i.e., in linear time with respect to the size of T[30]. 



V* € V, fi : | 




(11) 



If Tis known to be a sub-graph of the ATG or the ATG efi , then, also in linear 
time with respect to the size of T(0(n 2 ■ 2™)), F can be built using: 



More generally, under the hypothesis that all transitions in F are elementary, 
the set F of local transition functions of the network can be derived in time 
0(n ■ 2 2n ) by exploiting the following equation: 



In particular, Equation 13 can be used when the network is supposed to be 
updated in parallel. For other block-sequential and simple update schedules, 
Algorithm 1 builds a set of local transition functions F from the dual input 
information of a transition graph Tand an update schedule 5. To do so, it requires 
that Tbe indeed the graph of a function F : B n -> B™ so that T=T$ and F = 
F[5] be possible. In addition, importantly, S must be a simple update schedule. 
The reason for this restriction is that Algorithm 1 (just as an observer would 
need to do) relies on the knowledge that every apparent behaviour has no hidden 

cause. If the transition x » y is observed, then, on the one hand, all automata 

that have not apparently changed states (?/, = Xj) have not effectively done so. 
If they had, they would have had to change an even number of times in order 
to come back eventually into their initial state (x,). This is not possible when 
automata are updated at most once between two observations of the network 
configuration. On the other hand, if the automata that do change states have 
not done so as a result of a series of unobservable causes, then their changes of 
states can indeed be exploited to derive the interactions which are their direct 
causes. 

Let us highlight that block-sequential update schedules have the notable partic- 
ularity of allowing to observe the network configuration only once per period of 
updates without loosing any crucial information about the nature of limit be- 
haviours of the network. It suffices to know the usually non-elementary 10 tran- 
sition graph Ts to draw some significant information. For example, consider a 
network updated with a block-sequential update schedule S = (W / t)tgN/pN of pe- 
riod p. Fs necessarily contains a sub-graph of the following form, that is, a limit 

10 For update schedules 8 such as those that are considered in the present paragraph, 
unless the period of S is 1 as for the parallel update schedule, Fs is non-elementary. 




(12) 




(13) 
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Algorithm 1: From T and 8 to T in time 0(n 2 ■ 2 n ) 
Input: 

— A digraph T= (B n ,T) in which each node has out-degree 1 so that the only 
out-neighbour of any id" can be denoted by F(x) and 

— A simple update schedule 8 = {W t )ten/ P N- 

Output: A set of local transition functions T — {/, : B" — > B | i G V} such 

that T= Ts and F = F[5]. 
forall i 6 1" do 

y <— 

forall i e Wb do 

L fi( X ) < Vi'i 

forall t < p do 

x < — F Wt (x); 
forall i € Wt+i do 

L fi( x ) < — y*; 



behaviour with a certain period k e N: 

» F[()";(:k) » F[(5] 2 (a;) ; 

4^ : 
A : 

F >"]'•' '(.r) « F[S] k - 2 (x) « 1 

Because the transitions involved in this behaviour are not necessarily elemen- 
tary, there might be more than k events needed to loop on configuration x. 
Precisely, there might be between k and p x k. Some of the elementary updates 
required by S along the closed path from x to x may be null so it does not hold 
that non-elementary limit behaviours of period k correspond to elementary limit 
behaviours of period p x k. However, under block-sequential update schedules, 
it does hold that configurations x that are observed to be stable in Ts (i.e., 
configurations that have null effective out-degree and are fixed points of F[S}) 
really are stable: Vi G V, i G U(x). In the general case, on the contrary, the 
update schedule might allow automata to switch states several times between 
two network observations or it might omit the update of an unstable automaton 
so as to give the impression that the network is stable while it is not. Thus, 
unlike with other update schedules, with block-sequential update schedules, an 
observer that has knowledge of Ts can distinguish stable configurations from 
other limit behaviours. 

3 Theorisation and modelling of time 

In this section we concentrate on the "first step" of the modelling process which 
we call the theorisation step. Its purpose is to define the modelling framework 



17 



for the modelling of a certain category of real systems such as systems of genes 
that interact via their protein products. In other words, it aims at setting some 
grounds before any practical modelling of a real system (this next step frames 
Section 4) can effectively be done. Theorisation thus first needs to choose a for- 
mal language in order to describe the features of "reality" that are considered 11 . 
In our context, this language consists in the mathematical language that allows 
to express definitions relative to Boolean automata networks. Next, or simulta- 
neously, in this language, a theory is defined. In consistency with one another, 
definitions are given to formal objects. Properties of these and relationships 
between them are specified. This leads, for instance, to the theory of Boolean 
automata networks as described in Section 2. From the language and theory that 
are chosen follows immediately a correspondence between features of reality in- 
tended to be modelled and features of the theory supposed to model them. This 
reality/theory correspondence is dually composed of a modelling map (which, 
informally represents the reality — > theory direction of the correspondence) and 
an interpretation map (representing the opposite direction, theory — > reality). 
The modelling map specifies how portions of reality are represented mathemat- 
ically. For example, it may specify that genes are modelled by automata, that 
interactions between genes via the proteins they code for are represented by 
Boolean functions, that changes of protein concentrations in the cell are simu- 
lated by transitions of a transition graph. . . Conversely, the interpretation map 
associates a "modelling meaning" or a justification (possibly void) to each object 
and property of the theory. For example, according to this map, automata may 
be interpreted as genes, Boolean vectors may be interpreted as cell configurations 
and a local transition function /i : x i-> ^x A x\ can be regarded as modelling 
the fact that gene G\ remains expressed only if gene Go is not, given that au- 
tomaton i £ {0, 1} models gene Gj. Further, the "interpretation map" can also 
specify that a formal hypothesis stating that synchronous transitions are impos- 
sible (see Hypothesis 1 below) is the formal translation of the fortuitousness of 
two events ending simultaneously. 

The main difficulty of the theorisation step of modelling is the definition of 
both the modelling and the interpretation maps. Indeed, from the deliberately 
designed backbone of the reality/theory correspondence, it often follows some 
complex and subtle ramifications. These ramifications can impose implicitly that 
certain features of reality be matched to some precise features of the theory and 
conversely. But they can also, on the contrary, forbid some matches. Further, 
generally, neither the modelling map nor the interpretation map, are surjective: 
not all aspects of the theory can necessarily be interpreted as the representa- 
tion of something real and, obviously, not all aspects of reality are represented 

11 Let us note that the very act of describing mentally some observations is already in 
itself an automatic and often subconscious theorisation of reality so there obviously 
are some theorisations occuring at a "lower-level" than the level which is the object 
of this section. However, we ignore them here because their identification lies outside 
the scope of our competences (it requires, in particular, to have a good idea of what 
"reality" is precisely). 
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by the theory. As a consequence, the modelling framework itself requires thor- 
ough coherence and bounding in its definition. Modelled features of reality need 
to be identified and distinguished from its non-modelled features. Dually, mod- 
elling parameters and properties of a theory, which can reasonably be considered 
as representations of some portion of observed reality, also need to be identified 
and distinguished from its non-modelling features (those that are artefacts of for- 
malisation rather than pertinent representations of anything real). The present 
section focuses on the notion of time to highlight these difficulties and the co- 
herence that is required between the distinct associations specified or implied by 
the reality /theory correspondence. 

The very concept of transition from one network state x to another y suggests 
a notion of time that positions x before y. The term trajectory that is usually 
used instead of network path (see Section 2.5) re-enforces the natural association 
that can be made between an intuitive idea of time flow and a mathematical 
concept of causal precedence. The length of a series of transitions from x to y 
evokes the time that the system spends to go from one point x to another y in 
its state space. And, especially in a modelling context where the aim precisely 
is to relate experiences of reality and abstract concepts, the formal language 
used tends to adapt to implicit associations that are made to better understand 
theoretical objects and their properties. The question of the pertinence of these 
associations, however, is not always obvious. In the sequel, we first recall some 
definitions on dynamical systems and show how Boolean automata networks 
with a given behaviour can be seen in terms of this formalism. Then, we discuss 
how, in different formalisms, different points of view on the way time is taken 
into account in a model give rise to different problems and questions in its 
theoretical analysis. The section ends with an example that illustrates some of 
these questions and in which Boolean automata networks are seen as models of 
genetic regulation networks. 

3.1 Dynamical systems 

A discrete-time dynamical system, called simply dynamical system in the se- 
quel 12 is a triplet V = (S, 0, <j>) where S is the state space of the system, 0CN 
is its time domain (or evolution space) and 4> : S x — > S is the evolution 
function describing the system dynamics. It satisfies: 

Vs e S, Vii, t 2 € 0, 4>(s, 0) = s and <f>(<f>{s, h),t 2 ) = (/>(s, h + t 2 ). 

4>(s, t) represents the state of the system at time t so that the trajectory (or 
path) of V initiated in state s £ S is {<f)(s,t) \ t £ 0}. Let us note that for any 
initial state s € S, the function <f) s : t h-» </>(s, t) associates to every time step t, 
a unique image (j)(s,t). In particular, it defines the unique successor <f>(s, 1) of s. 

12 We bypass the difficulty of choosing a time space by assuming it is discrete. 
Continuous-time dynamical systems will thus not be mentioned at all. This choice 
is supported by the state space being discrete in our framework. 
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This allows for two types of dynamical systems: deterministic and stochastic. In 
the case of stochastic dynamical systems, however, the terminology introduced 
above needs to be adapted so that S rather denotes a set of probability laws on 
the state space of the system: 

Sc{ M e [o,i] 2 " | J2» x = l}. 

In this case, let x(t) be the random variable corresponding to the system state 
at time t and let fi(t) be the probability law of x(t). Then, 4> is such that /j,(t) = 
0(/x(O),t). 

Deterministic dynamical systems As an example of deterministic dynamical 
systems, let us consider a network updated with a periodic update schedule 
5 = (Wk)ke N/pN- As discussed in Section 2.8, there are several ways to describe 
the network behaviour in this case. First, focusing globally on periods of 5, 
one may consider the transition graph Ts in which every configuration has out- 
degree 1. The system is then context-free and can be defined as a deterministic 
dynamical system T> — (B n ,N, fa) where: 

Vx G B™, Vt G N, fax,t) = F[S\*(x). 

A second way to describe the network behaviour under 5 is to decompose the 
transitions x » F[S](x) into series of elementary transitions 

x — w t F Wt (x), t G N/pN. 

This yields the transition graph 7^ clcm which is not a state transition system 
like Ts but in which every node still has out-degree 1. 7^ clcm defines a context- 
dependent system (whether or not Wq can be updated in configuration x, for 
instance, depends on the previous elementary transition performed by the sys- 
tem) that can be seen as a deterministic dynamical system (B",N, cf)) where: 

MxeW 1 , Mt = k-p + d = d mod p, 4>(x, t) = F Wd o . . . o F Wl o F Wa o F[6] k . 

In this the definition of <j> shows, there is no time-independent (global 

transition) function fa that associates a unique successor fa(x) = fax,!) to 
every configuration x. 

Stochastic dynamical systems When the transition graph T= (B",T) de- 
scribing the behaviour of a network has nodes of out-degree greater than 1 , there 
generally is no obvious way of defining it as a dynamical system. However, with 
some additional indications or hypotheses, probabilities may be assigned to each 
transition of X This way, 7~can be seen as the graph of a Markov chain on B" 
and the adjacency matrix of Tcan be turned into a stochastic transition ma- 
trix, often called the Markovian matrix, i.e., a matrix P of dimension 2™ x 2" 
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satisfying: 

(i) \/x,yeM n ,P Xty e[0,i], 

(it) VxeB", £ weB „ P*,„ = 1 and 

(Hi) Vx, y e B", (x, y)iT^ P x , y = 0. 
The component in this matrix represents the probability that the network 
performs transition (x,y), i.e., the probability that it reaches configuration y 
given that it was in configuration x at the previous time step: 

P XtV =¥(x(t + l)=y | x{t) = x). 

The probability that it changes configurations is given by: 

V(x(t+l)^x(t))= ]T P x ,y. 

Then, the network behaviour can be defined as a stochastic dynamical system 
whose evolution function is given by: 

U(/U,0) = /i, 
\<Kli,t) =H-P\ 

where (i is an arbitrary probability law on the state space of the system (/j G 
[0, l] 2 ™ and J2xex = !)• Thus, if fi = /x(0) is the probability law of the initial 
network configuration x(0) € B™, then /L*(t) = t) is the probability law of the 
network configuration x(t) e B" at time step t (Vz G B n , = W(x{t) = x)). 

Example 8. Consider the network of Examples 1 to 3 and its GTG cft Tg° s = 
(B 3 ,T) and given in Example 7. Then, introducing a rate a e [0,1] at which 
automata of the network are updated (at each time step, any automaton i e V 
is updated with probability a) [57-59], the stochastic transition matrix P can 
defined as follows: 

** ■ (1 - a) Ux ~ d *v if (x,y)eT, 
otherwise, 



V(x,y) eM 3 xM^Px.y = 



where d xy = \D(x,y)\ and u x — \U(x)\ (see Section 2.7 for notations). This 
yields the following transition graph where transitions of Tg° h are labelled by 
their probabilities: 



(0,0,1)- -.—.(1,0,1) (1,1,1) < — ca (0,1,1) 



(0,0,0) as — > (1,0,0) «s — > (1,1,0) < a (0,1,0) 
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3.2 Modelling time 



In the case of networks seen as dynamical systems (deterministic or stochastic), 
because the set is called the time domain of the system and t € is called a 
time step, a moment or a date, an implicit notion of time is introduced that can 
be interpreted several ways. In this section we discuss three different points of 
view that can be taken, in the context of modelling, on the abstractions of time 
arising from the definition of a network behaviour. The first two points of view 
derive from the formalisms of dynamical systems. The last one follows from that 
of state transition systems which are called "causal systems" here to emphasise 
their difference with dynamical systems 13 . 

Modelling durations A first reading of the mathematical concept of time 
defined by dynamical systems consists in interpreting it as a literal match of 
the real time so that time steps in are taken as a unit of measurement of 
real time and all possible network transitions are supposed to take the same 
amount of time, that is, one unit. When the network behaviour is described 
by an elementary transition graph T, for instance, all elementary updates are 
supposed to take the same time, whatever the automata that they update. 

When the system movements modelled by transitions can however not all be 
assumed to take the same fixed amount of time, cannot be interpreted as a 
discretised version of a real time flow. In that case, to maintain a modelling of 
transition durations, the time domain must be backed up with some additional 
parameters. One straightforward method is to label each transition of the net- 
work by a value that measures the time taken by this transition, or rather, by 
the event modelled by this transition. In these lines, in [8, 42, 60-63], the authors 
have refined the formalism of Boolean automata networks in order to model ge- 
netic regulation networks and take into account some of the time delays to which 
regulations are submitted (see Section 3.3). 

In addition to the questions that are mentioned in the next paragraphs which are 
also natural and pertinent with more general approaches, this point of view (like 
any point of view) on time yields a set of theoretical questions that are specific 
to it. These questions rely on the strong hypothesis that the concept of time in 
the definition of a dynamical system is indeed meaningful in terms of modelling. 
As an example, let us cite the following non-exhaustive list: How long does the 
network take or is the network expected to take to reach a certain configuration 
or to start displaying a certain behaviour? What is the (most likely) network 
configuration that is reached in time t?, When or how long will the network 
display this behaviour?, How many times is the network expected to reach this 
configuration during this lapse of time?. . . 

13 Let us note that for similar reasons, that is, to emphasise the difference between the 
two main angles (causal and dynamical) that can be adopted to study Boolean au- 
tomata networks, in Section 2, we have deliberately chosen to use the term behaviour 
of a network rather than the terms dynamical behaviour and dynamics. 
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Modelling precedence A second reading of the mathematical concept of time 
inherent to dynamical systems consists in understanding it as a simple evolu- 
tion parameter defining no more than a relation of precedence between network 
configurations and without implying any notion of duration. If the two tran- 
sitions x » y and x' » y' are both possible, then, with this point of 

view, it becomes coherent to accept that x » y may take much longer to 

happen than x' » y 1 , under certain circumstances, while, perhaps, under 

different circumstances, the opposite is true (i.e., x' » y' takes longer than 

x » y). Thus, different behaviours of the network can take place at different 

time scales although no additional precisions aim at distinguishing these time 
scales or the different possibilities they yield. The mathematical concept of time 
of dynamical systems is regarded as a logical version of time and it requires less 
information on the nature of transitions and on how they happen. Paths or tra- 
jectories simply are sequences of successive events. The time they take cannot 
be measured but the number of events or elementary events they involve can 
however be counted. Consequently, the questions that characterise this point of 
view on Boolean automata networks and dynamical systems are of the follow- 
ing form: How many steps does the network take or is the network expected to 
take to reach a certain configuration or to start displaying a certain behaviour? 
What is (the most likely) network configuration that is reached after k steps?, 
Can a given behaviour be observed after a certain other?, What trajectories or 
behaviours are more likely?. . . 

Modelling causes One last point of view consists in ignoring altogether any 
associations that can be made between an intuitive idea of time (precedence as 
well as duration) and the theoretical features that follow from formalisation. This 
way, contrary to the case where Boolean automata networks are assimilated to 
dynamical systems, no more information than the transition graph is required. 
Any arbitrary Boolean automata network can be regarded as a causal system 
(i.e., a state transition system) stripped from any notion of time. Obviously, there 
is no notion of duration associated to causal systems. But neither is there any 
meaningful notion of time precedence that follows naturally and non ambiguously 
from their definition. Indeed, when several transitions (x, y k ), k <E N, are possible 
in the same network configuration x, then none of the network configurations 
y k is the configuration that is reached after x. Each configuration y k is only 
the result of one of several possible events which may occur with an unknown 
probability and within an unknown lapse of time. The notion of moment is 
therefore replaced by the notion of possibility and duration is replaced by a 
logical relation between causes and consequences. Two transitions (x, y) and 
(x,z) being possible means that x allows at least two different " continuations'' ', 
y and z. Causal systems are non-temporised systems in which the focus is placed 
on the degrees of freedom (i.e., the set of possible state switches) that the network 
and the automata in it have in each configuration. Time-related questions such as 
those that have been mentioned in the previous paragraphs loose their immediate 
meaning. Further, although the problem of how to prune a transition graph in 
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order to make all trajectories deterministic can obviously be pertinent in the case 
of a dynamical system (provided additional information, such as a stochastic 
transition matrix P that specifies what transitions can indeed be ruled out), it 
is not in the more general context of state transition systems. Indeed, in this 
context, transitions are associated to no more information than that of there 
own existence. Thus, no transition of the system that is a priori possible can 
be disregarded a posteriori, even if it represents a highly improbable event. The 
only pertinent questions in the context of state transition systems are "existence 
questions" such as: Can a configuration that satisfies a certain set of properties 
be reached from a given configuration x G B" ?, Is a given behaviour possible?, 
Can this transition be made?, What new behaviours can be reached or become 
possible if some new transitions are added?. . . One example of a problem that fits 
exactly into this framework is studied in [54]. The main question it addresses is 
whether the existence of synchronous transitions in a transition graph increases 
or decreases the possibilities in the network behaviour. 

3.3 Example: modelling time and genetic regulation networks 

In [15, 60, 62], the authors develop a formalism based on Boolean automata net- 
works to model genetic regulation networks. This formalism integrates a formal 
notion of delay to account for the time flow that the regulations are subjected 
to in reality. It also assumes the two following statements: 

Hypothesis 1 Only asynchronous transitions are possible. 

Hypothesis 2 All effective asynchronous transitions that update and activate 
(resp. deactivate) the same automaton take the same amount of time. 

More precisely, each automaton i € V is assigned two values rf^eM and d~ g R, 
called respectively its activation and deactivation delays. They represent the 
time it takes for automaton i to be updated and to change states: df (resp. 
d~) corresponds to the time i takes to switch from to 1 (resp. from 1 to 0). 
By Hypothesis 2, delays do not depend on the context: whatever the network 
configuration x e B™ in which i is unstable (i e U(x)), transition x — « — > x 1 
lasts ti time units where ti = d^ if xi = and ti = if Xi = 1. By Hypothesis 1, 
this covers all possible transitions so, in the rest of this section, we use the 
following notations: 



Example 9. Consider a network of size 2 whose structure is pictured by Figure 1 
and whose set of local transition functions is: 



x — d+ — > x % if Xi = and x — d i — > x % if Xi = 1. 




and 
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Fig. 1. Interaction graph of the network considered in Example 9. 



The ATG ofT of this network, completed with the delay specifications, is given 
below: 



0,1 



(1,0) 
t 

4 



t 
4 



(0,0) _«!+-> (0,1) 



An appreciable consequence of the addition of delays, in a modelling context, and 
one of their main purposes is to allow a natural deterministic interpretation of a 
network behaviour. Indeed, when two transitions are possible in a same config- 
uration, we may suppose that the one that does effectively take place is the one 
that happens faster. Thus, in configuration (0, 0) of the network of Example 9, 
for instance, if d$ < , then transition (0,0) — d+ — > (0, 1) can be supposed 
to be the one to occur. Otherwise, if > d± , transition (0,0) — 4 — > (0,1) 
can be considered as the most likely. 



Interpreting asynchronicity Let us analyse the meaning of Hypothesis 1 with 
respect to the formalism of Boolean automata networks. To do so, let x £ B" be 
a network configuration in which several automata are unstable. In particular, 
let i,j G U(x). By definition, both automata i and j are "on the verge of" 
changing states in x but by Hypothesis 1, they cannot both change states in that 
configuration. In other terms, transitions 



are both possible but transition 



and 



i!4 ,.!-.: 



is not. This apparent conflict between the predefined theory of Boolean automata 
networks (which specifies that both automata can change states and have no rea- 
son not to) and Hypothesis 1 (which disallows them from changing states under 
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some circumstances) can be addressed by additional notions of time and dura- 
tion. The delays implied by Hypothesis 2 and introduced in [15, 60, 62] provide 
these notions precisely. Indeed, labelling transitions by delays augments their 
meaning beyond that of a simple relation of precedence or causality as discussed 
in Section 3.2. And with this additional meaning, rather than stating that i 
and j cannot both change states, Hypothesis 1 can be understood as imposing 
that i and j cannot both change states simultaneously. It becomes significant 

and coherent to consider what happens during a transition x > y, between 

the moment where the network is in configuration x and the moment it reaches 
configuration y. This way, more precisely still, Hypothesis 1 can be taken as the 
claim that no two automata can finish changing states simultaneously. 

With this interpretation of Hypothesis 1, let us suppose that in configuration x, 
i and j are respectively subjected to the delays di G {d~ , d+} and dj G {dj , d^} 
and that: 

di < dj. 

In configuration x, say at time 0, both automata start changing states since, ac- 
cording to the theory, they can. However at time di, automaton i has effectively 
changed states whereas automaton j has not yet had the time to. At this mo- 
ment, only two situations are a priori coherent with both the theory of Boolean 
automata networks and our interpretation of Hypothesis 1: 

1. Either j has become stable (j G Uix 1 )) in which case it must be that j is 
influenced (directly or indirectly) by i (i.e., there is an arc or a path from i 
to j in the network interaction structure) ; 

2. Or j is not influenced (neither indirectly nor directly) by i (and nor is its 
instability) and thus, the change of states of i (effective at the current time 
di) has not affected j which can still change states in x 1 (j G Uix 1 )). 

The second case requires more attention and supplementary justifications. In- 
deed, in this case, Hypothesis 1 imposes that the trajectory: 

di . d i r . .. 

X > x l > x {l ^ 

is possible while the trajectory: 




is not. This means that even though j starts changing states in x, it must stop and 
start all over again in x 1 , making the whole process last di + dj time units rather 
than max{di,dj} — dj. Consequently, the change underwent by i in transition 
x — i — > x 1 which a priori involves i alone has an impact on the possibilities of 
j. And this impact is non-negligible since it concerns the duration of an event 
in a context where durations, precisely, have been given significance. There are 
two ways to preserve the consistency of the fact that i thus influences j in all 
cases and the formal basis of the theory of Boolean automata networks. One 
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can either suppose that two automata cannot be simultaneously unstable unless 
they have a mutual and symmetric influence on one another: 

Vi, j e V, 3x e B™, {i,j} e H(a;) => (i, j) G A and {j, i) e A. 

Or one can rule out simultaneity altogether and take on the following hypothesis: 

Hypothesis 3 No two distinct events can start, finish or occur synchronously. 

Because the first solution is very restrictive, we choose the second and for the rest 
of this section, we accept Hypothesis 3. Let us, however, point out that this snag 
introduced by Hypothesis 1 highlights the difficulty, in a modelling context, of 
justifying restrictions or refinements brought to a theory. In particular, it shows 
that choosing to exclude some elementary transitions in the description of the 
behaviour of a Boolean automata network is a non-trivial choice when the aim 
is not simply theoretical convenience. Indeed, to defend this choice some strong 
arguments are needed (such as those required to justify Hypothesis 3 which is 
needed to support Hypothesis 1, as we have seen above) that can obviously not 
be drawn from the theory itself but must still remain coherent with it. 

Genetic regulation systems and Boolean automata networks with de- 
lays To study how Boolean automata networks with delays model genetic regu- 
lation systems, let us now consider a simple genetic regulation system involving 
exactly n distinct genes Go, ... , G„_i. Very schematically, each gene Gj may un- 
dergo a process by which the information it carries is "decoded" and "read" [64, 
65] to induce the synthesis of one or several proteins. For each gene Gj, let Pi be 
one of these proteins products of Gj 14 . When Gj is being decoded and read, we 
say that it is active. The activation of genes Gi, . . . , G„_i can be modelled by 
n Boolean variables g , . . . , g n -\ so that, Vi € V = {0, . . . , n — 1}, gi = 1 (resp. 
gi = 0) represents the activation (resp. the non-activation) of gene Gj. 

Proteins can regulate the activation of genes. Thus, a gene Gi, via its protein Pi, 
may have a retro-action on the system of genes to which it belongs. For instance, 
it may influence the activation of gene Gj. If it does, this (indirect) regulation 
of Gj by Gi can be represented by an arc (i, j) £ A of an interaction graph 
Q = (V, A) . More precisely, and with respect to other protein influences that can 
act on Gj, it can be represented by a local transition function fi : M n — > B. 

The effect of a regulation modelled by an arc (i,j) e A is however subjected to 
there being enough proteins Pi in the cell. Here, for the sake of simplicity, we 
suppose that each protein Pi becomes able to regulate the activation of any other 
gene if and only if its concentration exceeds a certain threshold. If it does, the 
protein is said to be active. This hypothesis precisely allows a Boolean modelling 
of genetic systems: a protein can either be active with respect to all influences 

14 We consider only one protein for each gene for the sake of simplicity. Considering 
several would be more realistic and would not alter the basis and thrust of our 
argumentation. But it would considerably burden its formulation. 
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it may have on the system, or it may be inactive for all of them (if several levels 
of activation of Pj needed to be considered, it could not be modelled by just two 
states). 

Let us say that gene Gi is expressed when protein Pj is active. And let us highlight 
the difference in our use of the two terms active and expressed concerning genes. 
Gene Gi may be active without being expressed. This usually happens right after 
the precise instant Gi is activated. Before it becomes expressed as well, that is, 
before the precise instant the concentration in Pj hits its activation threshold, a 
time lapse is required during which the concentration in Pj increases but remains 
under its activation threshold. This time lapse, precisely, is modelled by the 
delay . Conversely, Gi may be inactive and expressed. Indeed, following the 
instant Gi is deactivated, it may remain expressed during some time. This time 
corresponds to the time it takes for the concentration in Pi to decrease as a 
result of the molecules degradation and of their not being renewed (since the 
synthesis of Pj is no longer "commanded" by Gi), and fall below its activation 
threshold. It is modelled by the delay d~ . In summary, delays df and d~ both 
represent the time that Pi takes to respond accordingly to a "command" sent by 

More formally, let us introduce n new Boolean variables, x , ■ ■ ■ x n -i, to model 
the expression of each gene Gi, or equivalently, to model the activation of each 
protein Pj: Vi G V, X{ — 1 (resp. Xi — 0) models the expression of Gi (resp. its 
non-expression) and the activation of protein Pj (resp. its non-activation). Then, 
to model the regulation system comprised of genes Gi and their products Pi , let 
us consider a Boolean automata network of size n, whose interaction structure 
and local transition functions are as suggested above and whose automata states 
are the variables Xi, i G V. Thus, a network automaton i G V rather represents 
the protein Pj than its coding gene Gj 15 . Let us consider a situation in which the 
activations of genes Gj are modelled by the vector g = (g , . . . , .g„_i) G B™ and 
the activations of the proteins Pj are represented by the network configuration 
x = (xq, . . . , x n _i) G B™. To model such a situation, we use the following matrix 
which, abusing language, we call the network configuration: 



(14) 



In [15,60,62], only configurations where gi = fi(x) are considered, i.e., Hy- 
pothesis 4 figuring below is made. As a consequence, gi = Xi is equivalent to 
automaton i being stable (i G U(x)) and, Vi G V, gi = Xj is equivalent to the 
configuration being a stable configuration. 

Hypothesis 4 The only possible network configurations have the following form: 



X 




x ■ 




_ 9 _ 




_ 9o ■ 


■ 9n-\ 



x 
9o 



9n-l 



x ... ir n _i 

fo(x) ... f n -l(x) 



15 This is only a convention that we set and which follows naturally from our choices of 
notation. Other presentations that lead to automata corresponding to genes rather 
than to proteins are possible. 
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Example 10. With the notation introduced in Equation 14 and with Hypothe- 
sis 4, the transition graph of Example 9 still has only four configurations, those 
that satisfy g = 1 and gi — ^x V X\\ 



0,1 
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In particular, the following network configurations are supposed to be unrealis- 
able: 



1 1 
1 



and 



1 
1 1 



Boolean automata networks with delays can thus serve as models in which a 
mathematical notion of time flow and duration is introduced to match the time 
that rules over the interactions of the real system they represent. As mentioned 
in the previous paragraphs, it is important to note, however, that the perti- 
nence of the modelling strongly depends on the hypotheses that are added to 
the original theory. More specifically, it depends on the interpretations of these 
hypotheses and on their consistency with respect to prior interpretations of other 
features of the theory. We have already discussed above Hypothesis 1 and how 
its significance relies on the additional notion of time introduced by means of 
delays, and more precisely, how it relies on Hypothesis 3 (unless, as we have 
demonstrated, all network automata are supposed to interact two by two). We 
propose now to analyse further the role of hypotheses and the correspondence 
between this refined version of the theory of Boolean automata networks and 
the genetic regulation systems that they are intended to model. 



A rigorous adaptation and interpretation of Boolean automata net- 
works with delays By Hypothesis 3 (and a fortiori by Hypothesis 1) , we have 
assumed that proteins do not receive commands from their coding genes imme- 
diately when these are emitted. For the precise same reason {i.e., two distinct 
punctual events are necessarily separated in time by a lapse of time, however 
small it be), let us suppose similarly that genes cannot change states simulta- 
neously to the changes of the states of their regulating proteins. Of course, one 
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Fig. 2. Diagram representing possible behaviours of the regulation network of Exam- 
ples 9 and 10 under the assumption that < dj~. Let us emphasise that no effort 
whatsoever has been put into representing the time scale. Consequently, despite the 
space that separates the representation of the dates dg and dg +d /i 5 it is n °t excluded 
that w dg + d /i- 



can argue that the two phenomena evoked are set in different time scales and 
that the second direction involves times lapses that are completely insignificant 
compared to those involved by the first. Nevertheless, the point is to respect Hy- 
pothesis 3 rigorously and rule out simultaneity altogether. Thus, for any couple 
of events that are considered, one event must necessarily occur before the other. 
Therefore, either the change of states of a gene and that of its regulating protein 
must be considered as one and only event, or the changes must be separated in 
time by a delay, possibly very small. We choose the second solution (by identi- 
fying gene state changes and protein state changes, the first solution demands 
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too severe renouncements of the original modelling). Thus, we denote by di/j 
the time it takes between a change of states of protein Pj and the according 
change of states of a gene Gj which is regulated by it. And we do not exclude 
the possibility that the new delays di/j be evidently negligible in comparison to 
the first delays introduced, df and d^ . 

Let us concentrate on the Boolean automata network of Examples 9 and 10 
and let us consider this network as a model of a genetic system with two genes 
Go and G\ and their two protein products P and Pi. Under the three original 
hypotheses 1, 2 and 4, the behaviour of this network is given by the transition 
graphs of Examples 9 and 10. Under the additional but necessary Hypothesis 3, 
no network configurations (as defined by Equation 14) are excluded any longer. 
This implies in particular that Hypothesis 3 (and a fortiori Hypothesis 1) and 
Hypothesis 4 cannot strictly be satisfied at once. And since transitions still are 
asynchronous, in theory, in our example network of size two, four elementary 
transitions must be considered in every configuration (some of which may be 
null): 



5o xi 
5o 5i 



<f— 



x x x 
fo(x) 9i 
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X\ 
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xo 51 
5o 5i 



x x\ 
5o h{x) 



Rather than giving the complete new version of the transition graph of this net- 
work (which contains 16 configurations), let us suppose that d$ < d x and let us 
consider the diagram pictured in Figure 2. It describes informally four alternate 
scenarios that can occur under the current general Hypotheses 1, 2 and 3. All 
four of these scenarios start with the activation of Pi in the network configu- 
ration modelling the situation in which both genes are active but unexpressed. 
The system is supposed to be isolated and free of any exterior perturbations so 
that failures in protein concentrations (resp. in the expression of genes) due to 
other factors than the deactivation of their coding genes (resp. the inhibition of 
their regulating proteins) are ignored. In particular, cases in which go = are 
disregarded. Let us comment on this new modelling of the system behaviour. 
First, several "kinds" of configurations 

1 1 
1 1 

are involved. There are those in which the signal emitted by protein Pi addressed 
to its coding gene G\ has not (yet) been received and there are those in which 
it has. Figure 2 suggests that in the latter configurations, the activation of G\ 
is more stable than in the former in which the activation of G\ is re-enforced 
by the presence of an activating protein (rather than just being made possible 
by the absence of an inhibiting one) . Second, besides their respective complexity 
and the transient trajectories they involve, Figure 2 and the transition graph 
of Example 10 yield similar results if one focuses on limit behaviours. These 
remarks may encourage one in arguing that Figure 2 represents a futile com- 
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plexification of the original modelling. However, let us recall that, importantly, 
this new modelling follows from a rigorous adaptation of the original theory of 
Boolean automata networks with delays and that it was made necessary to make 
the theory fit coherently to its priorly intended interpretation. Moreover, in this 
context where Hypotheses 1, 2 and 3 all need to be respected, the possible du- 
ality in the meaning of network configurations that are formally considered as 
one unique configuration, as well as the existence of different intermediary steps 
leading to the limit stable configurations are important. Both these features of 
the modelling cannot be disregarded a posteriori since they result directly and 
essentially from the work of theorisation and interpretation that was made a pri- 
ori. In particular, if the network was actually a sub-network of a larger network, 
then some other automata may be affected by the states of the two automata 
that are considered. As a consequence, the behaviour of the whole network could 
be considerably affected by slight deviations from what is predicted by the orig- 
inal transition graph. These deviations could, for instance, consist in gene G\ 
being less stable in some configurations where x = x\ = go = gi than in others. 
Or they could be the intermediary steps through which the network may pass 
even if only to remain a fleeting instant. Since our new formalism is a rigorous 
refinement and adaptation of the original, the pertinence of the modelling it pro- 
duces cannot be questioned unless that of the original version is. It can, however, 
be criticised for its increased complexity all the more so that Figure 2 proposes 
an incomplete description of the network behaviour. Indeed, in Scenario 3, not 
all signals emitted are received. Protein Pi is active during a certain period, i.e., 
its concentration in the cell exceeds the threshold for some time and yet, its 
coding gene G\ ignores it. For the sake of formal rigour and consistency, the de- 
scription should be completed with a sequel to this branch which would increase 
further the complexity of the description. Further, to interprete this new formal 
description and to gauge the feasibility and plausibility of any of the scenarios it 
mentions, relationships of precedence need to be established between the events 
that are considered. Consequently, delays (and perhaps protein concentrations) 
need to be compared precisely. But this leads to the conclusion that to exploit 
the new formalism, it must be refined further still, suggesting in the end that a 
continuous framework would be better suited for the modelling problems at hand 
here. Thus, an eventual change of modelling paradigm seems to be required. 

Boolean automata networks with delays thus provide indeed a modelling of some 
real phenomena that takes time flow into account. The supplementary parame- 
ters that are integrated in these models apparently provide an enhancement of 
the original model that potentially allows to better understand the conditions re- 
quired for one scenario to occur rather than another according to the delays that 
separate the events involved in each. But the modelling power of this formalism 
relies substantially on the non trivial interpretation of some formal hypotheses, 
in particular, the absolute non-simultaneity of any (lasting) couple of events and 
the assumption that whatever the current state of the cell, a protein always takes 
the same amount of time to change states. 
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4 Effective modelling and observing of a system 

Once a formal well-bounded framework has been accepted and coherently as- 
sociated to a correspondence with reality, the observing and modelling of the 
behaviour of a particular system can be carried out. The present section is set 
in this general context. More specifically it lies on the hypothesis that neither 
the machinery of a system nor its movements can ever be effectively observed. 
Instead, it is the successive punctual positions or states of the system that can 
be. As an illustration, on the 9 th of august 2011 at local time 12 : 30 : 56 in the 
place with geographic coordinates (45.662587,1.789201), we may observe that 
proteins Pi and Pi are both present in very high concentrations in the liver cells 
of the rat Bobby. Two minutes later, we may note that the concentration of P\ 
is almost null. Further, we may perform the same experience at different dates, 
times, places and with different rats, and make similar observations whereas a 
second experience in which P 2 is initially absent always yields different obser- 
vations: the concentrations of P\ that are recorded always are approximately 
identical during several hours. In both experiences, no dynamical phenomcnons 
such as concentration changes are observed, only their results are. In addition, 
the influence of protein P 2 on the concentration of protein P x is not observed 
effectively although several situations are observed from which this influence (or 
perhaps a different conclusion) may be inferred. Now, primarily, the very nature 
of modelling makes it impossible for any model to even come close to reproduc- 
ing or describing a real system faithfully: from the start, the theorisation process 
can only aim at designing a framework that hopefully will allow models to re- 
produce, describe or explain partially some properties of the system. Here, we 
add the hypothesis that the only available information concerning a real system 
is derived from necessarily partial observations of its behaviour. In summary, in 
our context, the effective modelling of a real system is supposed to start with a 
series of observations of its behaviour. Once a "reasonable" collection of obser- 
vations has been established, it becomes possible to build a model of the system 
behaviour. To derive from it a model of the system structure and underlying 
mechanisms, however, is a tricky task that necessarily encounters sources of in- 
completion which call for supplementary approximations and hypotheses. And 
these new approximations and hypotheses have notable significance with regards 
to the reality/theory correspondence. 

4.1 From observations of a system to a model of its underlying 
interactions 

The very first step in the process of effective modelling consists in determining a 
set of formal configurations to model the states in which the system is observed. 
Here, we focus on systems that contain a finite number of interacting elements 
which are intended to be modelled by Boolean automata networks so the formal 
configurations that are chosen are Boolean vectors x € B" where each Boolean 
coefficient Xi e B of x is supposed to represent the state of a system element 
identified by the label i < n. The choice of the set of configurations is crucial 
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because it requires a knowledge of what is the "interior" of the system that is 
modelled and what is its "exterior". Indeed, to specify the size of the model 
network and decide what is the dimension n of the Boolean vectors, the number 
of elements interacting in the system needs to be known. This often means that 
the set of elements itself needs to be known. Therefore, in most cases, from 
the very start, modelling requires to assume or to know that the content and 
the frontiers of the system to be modelled are exactly those that are suggested 
by or derived from observations of it. In other terms, unless certainty may be 
established on this subject, the following hypothesis is needed: 

Hypothesis 5 The number n e N of interacting automata in the network is 
known. 

Then, the movements or changes of the system can be considered for modelling. 
Let us suppose that x € B" is a configuration that models the system state 
which has been observed right before the system state modelled by configuration 
y e B™. Then, the change underwent by the system between these two observed 
states is modelled by the couple (x, y) € B™ xl". Such couples are called observed 
transitions and denoted as follows: 

X rr^r^y. 

It is important to note that observed transitions are not necessarily network 
transitions. This will become clear in the sequel. The set of all observed transi- 
tions derived from observations of a system defines an observed transition graph 
whose role is precisely to model the system behaviour. 

With an observed transition graph Tbs in hand, the modelling can be pushed 
a step further. The aim is now to infer from Tbs knowledge concerning the un- 
derlying mechanisms that contribute in producing the events that are described 
in Tobs- In short, this third step of modelling aims at determining the causes of 
the effects observed. Thus, the information carried by Tobs must be analysed and 
exploited in order to define a set of local transition functions T and a network 
structure Q which must both be coherent with Tobs, with the hypotheses that 
were made beforehand, with the underlying theory, and with its correspondence 
with reality, pre-defined by the prior work of theorisation. A natural way to do 
this is to derive F using one of Equations 11, 12 and 13 or to exploit Algorithm 1 
and then derive Q from T. However, as mentioned in Section 2.10, this method 
requires some additional information and failing which, some additional hypoth- 
esis concerning the nature of the transitions in Tbs- As a consequence, if T and 
Q are defined this way and if they are effectively intended to be used, studied 
or exploited to derive additional information on the original system, then the 
following question needs to be answered: how can T and Q be interpreted, i.e., 
what modelling power can they be granted? In the absence of a complete and 
satisfiable answer to this question some new hypotheses need to be made. 

The rest of this section proposes to illustrate some of these hypotheses and dis- 
cuss their meaning and impact. In the examples that are taken, for the sake 



34 



of their simplicity, Boolean automata networks serve as models of Boolean au- 
tomata networks. In other terms, the networks play both the parts of observed 
system to be modelled and model of the observed system. One might imagine 
that a computer program simulates the behaviour of a Boolean automata net- 
work N and prints out on a monitor the states it takes. Of course, when the 
program is executed, on the one hand, the program may be designed to hide 
some parts of the information and on the other hand, our diligence in watch- 
ing the screen may not be perfect. We may, for instance, only look periodically 
at it, or from time to time at random, or we may perhaps forget to look at it 
altogether. In any case, the lists of successive configurations that are observed 
yield a transition graph Tobs from which, as mentioned above, sets of automata, 
of interactions and of local transition functions may be inferred to define a new 
Boolean automata network N' supposed to model the original network N. 

Example 11. Let us first illustrate the importance of Hypothesis 5. To do so, 
let us suppose that the observed transition graph Tobs contains only the two 
following transitions: 

(1,0) tt^ (1,1) and (0,0) o~o-»(0,l). 

By Hypothesis 5, we can start by deriving that the set of configurations of the 
observed network N is B 2 and that it involves two automata which are referred to 
here as automaton and automaton 1. Choosing to model these two automata by 
two automata with the same names and using Equations 13 or 12 (i.e., assuming 
transitions of Tobs are elementary) yields the following local transition functions: 

f : x e B 2 ^ x Q and f[ : x E B 2 ^ 1. 

The resulting model N' of N has the same behaviour as N (its set of possible 
transitions is exactly those of Tobs)- It has T — {/q, /(} as set of local transition 
functions and the digraph pictured below as interaction structure: 

Q) © 

If Hypothesis 5 were not satisfied, however, N could contain a third automaton, 
automaton 2, that observations of N give no evidence of. Then, when transition 
(1,0) t-y-^ (1,1) is observed, it could be that what is really occurring is a series 
of transitions that has the same effect of increasing the state of automaton 1 but 
involve the hidden automaton 2: 

(1,0,0) » (1,0,1) (1,1,1), 

or perhaps just: 

(1,0,1) — ^> (1,1,1). 

In that case, according to whether automaton activates automaton 2 or not, or 

according to whether (1, 0, 0) » (1, 0, 1) is possible or not, the local transition 

function of automaton 2 could equal: 

/ 2 :ieB 3 HH or f 2 : x e B 3 ^ 0. 
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That of automata and 1 could respectively equal: 

/o : x e B 3 i y x Q and f 1 : x e B 3 i-» x : V x 2 . 

As a consequence, rather than the interaction graph of TV' that suggests that 
automata and 1 are independent and depend only on themselves, network N 
could instead have one of the interaction graphs pictured in Figure 3. 58 




b. 



Fig. 3. Possible interaction graphs of Boolean automata network whose transition 
graph Tobs presented in Example 11. 

Example 1 1 shows that relying on Hypothesis 5 in the construction of a model 
N' of a system TV may possibly be responsible for the inferring of a structure that 
differs significantly from the reality of N. This way, Hypothesis 5 may cause to 
mistaken the cause of an event that is observed (the change of state of automaton 
1 in Example 11, for instance). Now, let us consider some new hypotheses that 
apply more specifically to the transitions of T h s - 

First, Hypothesis 6 below allows to focus effectively on T bs & n d on any infor- 
mation (such as Q and J 7 ) that can be drawn solely from it as suggested above. 
Indeed, it assumes that no exterior force or reason is responsible even partially 
for an observation that is made: 

Hypothesis 6 Every behaviour of the system is caused by factors that are con- 
tained in its own definition. 

Example 12. Let us suppose that the observed Boolean automata network N 
satisfies the following properties. It contains two automata, automaton and 
automaton 1 whose local transition functions are defined by: 

V.t € B 2 , fo(x) = x\ and /i(ar)=a;o. 

Its interaction graph is the digraph pictured on the left of Figure 4 and, for 
some reason, N can only perform asynchronous transitions and it can potentially 
perform them all. Its behaviour, in the absence of any exterior perturbations is 
described by the ATG eff on the right of Figure 4. 

Now, let us suppose that N is subjected to some exterior forces that impose re- 
strictions on its behaviour and that these restrictions can be translated in terms 
of updating constraints. For instance, it could be that active automata tend to be 
updated before inactive automata. In that case, transitions (0,1) — o — > (1,1) 
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(1,0) i_>(l, l)^ 

Fig. 4. Interaction graph and ATG cfT of the Boolean automata network of Example 12. 

and (1,0) — i — > (1,1) in the ATG ofT of Figure 4 may be considered as unlikely 
or impossible. Here, let us suppose differently that an exterior force causes au- 
tomaton to be much faster in switching states than automaton 1 is. Thus, N 
behaves as if it were obeying to the sequential update schedule 5 = {0}{1}. An 
outside observer that is aware that N only performs asynchronous transitions 
might possibly ignore that N is submitted to any updating constraints and as a 
consequence might record the following transition graph T bs'- 



(0,1) 
o 

(1,1)-) 
\_0,1 

If this were the case, the model N' of N would have local transition functions 
defined by: 

Vrr GB 2 , f' (x) = f[{x) = Xl 
and the following structure: 




The influence that automaton has on automaton 1 would thus not be revealed 
at all because of the precedence of 0-updates over 1-updates. X 

The example above highlights the non-triviality of choosing to ignore the possi- 
bility that the network behaviour might be subjected to exterior influences that 
cannot be formalised as proper automata influences. In this example, focus on 
the particular but important case of influences that can be taken into account 
through the defining of an update schedule. 



O' 1 \ 

e (o,o) 

o 



(1,0) 
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The next hypothesis specifies that every system behaviour that should have been 
observed, with respect to the "observation protocol", has indeed been observed 
and modelled in T bs- 

Hypothesis 7 With respect to the nature of observed transitions, Tobs contains 
all paths and behaviours of the system model. 

Thus, if the system is supposed to have been observed often enough so as to 
witness each of its least changes then, by Hypothesis 7, Tobs is the GTG or the 
GTG off of its model. If, on the contrary, the system is not supposed to have 
been observed often enough to acknowledge every elementary transition, then, 
Hypothesis 7 imposes that Tobs still contain a "representation" of any possible 

system behaviour. If transition x » y is a (possibly elementary) transition 

that models a system change of states, then, by Hypothesis 7, it must be taken 
into account: the path that involves this transition must be represented in Tobs, 
even if not elementarily. In some sense, Hypothesis 7 is a stronger version of 
Hypothesis 6 that assumes that all behaviours that an observer has no knowledge 
of are impossible under all circumstances. Conversely, it means that any change 
that is not observed is not possible, i.e., any transition that does not belong 
to %bs either is a contained in a path of Tobs or is impossible altogether. As 
a consequence, Hypothesis 7 implies Hypothesis 8 below which relates (if not 
equates) the exercise that consists in observing movement to that of observing 
fixity. 

Hypothesis 8 A configuration with out-degree null in Tobs models a stable con- 
figuration of the system. 

Example 13. Let us suppose that only one transition of the Boolean automata 
network N has been observed: 



Using Equation 12, we derive a model network N' of size 2 with the following- 
local transition functions: 



Supposing that Hypothesis 7 is actually not satisfied because, for instance, tran- 
sition (0, 1) » (1)1) nas been missed, it could be that the local transition 

function of automaton in N is in reality /o : x n- xq V x\ rather than function 
/o- And the structure of N rather looks like: 



(1,0) 



(1,1)- 




and 




The interaction structure of N' is: 
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than like the structure of the model N'. Missing some observations leads here to 
miss some interactions. Conversely, missing some observations can lead to infer 
interactions that do not exist. Indeed, this happens if network N can perform 

transition (0, 0) » (0, 1) although it has not been observed. If this is the case, 

the local transition function of automaton 1 in N does not equal /{ either but, 
instead, the constant function /i : x 1 and the structure of N is as follows: 



c 4 





Thus, in reality, automaton 1 of N could be submitted to no influence from 
automaton contrary to what is suggested by model N' . Further, combining 
this latter example to Example 11, we find that it could also be that N has a 
structure that looks like that of Figure 3 a. (this could happen if automaton 2 
was not observed and transition (0,0) » (0, 1) was missed). 38 

The examples above highlight how failing to observe some of the possible move- 
ments of the system may hinder the correct reconstruction of its set of interac- 
tions by either leading to the inference of non-existing interactions or missing 
some existing ones. 

In some cases, to exploit Tobs, one may hypothesise on the granularity of events 
as in the last two hypotheses we mention here. 

Hypothesis 9 Tobs is an elementary transition graph. 

Example 14- Let Tobs be the following digraph: 

(0,0)^^(1,1) (0,1) d^ (1 ' 0) 

Assuming all transitions in Tobs are elementary and using Equation 13, a model 
N' may be derived with the following local transition functions: 

/o : x e B 2 i-» -nx and /J : i £ B 2 4 ->X\. 

and the structure pictured below: 



Q 9 



According to this model, the two automata of N do not interact. However, let 
us suppose that Tobs has been derived by observing the configuration of N only 
once per time unit, while, in reality, the network TV performs two transitions 
per time unit. Contrary to Hypothesis 9, Tobs is not elementary. Further, let us 
suppose that the synchronous transitions that are observed in Tobs are actually 
due to a series of two asynchronous transitions so that the ATG ofI of N is: 

(0,0) i— > (0,1) 

o o 
(1,0) <-i (1,1) 
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and its structure is the digraph below: 




Example 14 shows that assuming that observed transitions are elementary is 
not a negligible choice either, in terms of modelling. In the same lines, the next 
example shows that this is true even when transitions seem to be asynchronous. 

Example 15. Let N be of size 3 and let us suppose that its behaviour has been 
observed starting in each of its 8 different configurations. As a result of this 
experimentation the following list of possible transitions has been obtained: 

(0,1,0)— ■* (0,0,0) (1,1,0) (1,0,0) 



(1,0,1) orww (1,1,1) (0,0,1) onnrv^ (0,1,1) 

Assuming, by Hypothesis 9 that all four of these transitions are elementary (and 
asynchronous), a model N' can be derived with the following local transition 
functions: 

f' :xE B 3 i y .t , f[ : x e B 3 x x V x 2 and f 2 : x e B 3 i-> x 2 . 
and the following interaction graph: 




This modelling of N by N' suggests that the only interaction existing between 
two different automata of the network consists in the activation of automaton 
1 by automaton 2. Supposing again that Hypothesis 9 is not satisfied, it might 
be that N is actually updated with the update schedule S = {0, 1}, {1, 2}, {0, 2} 
of period 3. In that case, the four apparently asynchronous transitions that are 
observed are actually rough approximations of the four sequences of transitions 
that figure below. As before, this loss of information may be explained for in- 
stance by the fact that only one observation per time unit was made while N 
performed 3 transitions per time unit. 



{0,1} * \ {0,1} {1,2} {0,2} 

(0, 1, 0) > (0, 0, 0) J (1, 1, 0) > (0, 1, 0) > (0, 0, 1) (1, 0, 0) 

{0,1} ^\ {0,1} {1,2} {0,2} K 

(1,0,1) > (1,1,1) (0,0,1) ^-^> (1,0,1) > (1,1,0) —^(0,1,1) 
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It can be checked that in this case, the local transition functions of the three 
automata of N, rather than being equal to the functions f[ of the model N', 
could equal the following 

V.x G B 3 , fo{x)=x 2 , fi(x)=x , f 2 {x)=x 1 , 

so that the interaction structure of N would be a circuit: 




Thus, as demonstrated by Example 15, even an apparently asynchronous tran- 
sition can in reality be a series of synchronous transitions. Let us note that 
generally, when Tobs equals a transition graph of the form Ts where S is a block- 
sequential update schedule, then, Equation 11 yields a model N' in which, infor- 
mally, all the information of N has been "condensed" due to the application of 
5 and to the transitivity of sequences of interactions [23,33,37]. As mentioned 
in Section 2.10, however, the condensed version of the information cannot be 
unravelled unless 5 is known precisely (see Algorithm 1). 

Finally, when, contrary to the previous paragraph, Tobs is not supposed to be 
elementary (for instance, because a specific update schedule is assumed) then, 
the information on the system carried by %bs is incomplete unless the series of 
system changes that occur between two successive observations of the system 
can be reconstructed. 

Hypothesis 10 The additional information required to break down each non- 
elementary transition of Tobs into an elementary path is known. 

Hypothesis 10 resembles Hypothesis 9 but is stronger. While Hypothesis 9 as- 
sumes that there is nothing more to each transition than what is observed, 
Hypothesis 10 assumes knowledge of what more there is to the transitions that 
are observed and considered non-elementary. 

Example 16. Let Tobs be the following observed transition graph: 

^^^^^ ^ 

^(1,0)^°^ 

where the bottom two transitions are supposed to be elementary and the remain- 
ing transition (0,0) o^->(l,l) is supposed to be an approximation of the ele- 
mentary path (0, 0) -nnn_> (1, 0) o"r^-> (1, 1). Such hypotheses might for instance 
be made naturally if N is considered unable of synchronicity and if transition 

(0, 1) » (1, 1) is supposed to be impossible because it has not been observed. 

This, with Hypothesis 7 and Equation 12, yields a model N' which has the 
following local transition functions: 

/g : x e B 2 i-» x V -ix i , /|:j:el 2 4ioVxi 



41 



and the following interaction graph: 




Suppose now that Hypothesis 10 is wrongly assumed and all transitions observed 
are not necessarily asynchronous. TV is able to perform synchronous transitions 
and in particular, it can perform transition (0,0) — {04} — ► (1,1) in one step. 
Then, it could be that the local transition function of automaton 1 is actually: 

fi ■ x h-> 1 

and its interaction structure is: 




Thus, wrongly assuming Hypothesis 10 can also lead to deriving the existence 
of interactions that do not take place in reality. 38 

To make up for the lack of information, modelling needs to put forward and rely 
on some supplementary hypotheses such as Hypotheses 5 to 10. This proves the 
importance of hypotheses. On the one hand, they allow modelling to happen. On 
the other hand, as we have endeavoured to show with the examples of this section, 
they can mislead it. In particular, they can cause the interaction structure of a 
system and of its model to differ significantly, mainly due to the fact that even 
in very simple instances of Boolean automata networks, there might be different 
possible causes to a same effect. 



4.2 Defining networks 

Relying on the previous sections, we now can propose the following formal defi- 
nition of Boolean automata networks: 

Definition 1. A Boolean automata network N of size n G N is defined by a set 
of n local transition functions: 

N = T = {fi I < i < n). 



Let us note that by Section 2.10, from a set T of local transition functions, the 
GTG of the network defined by J 7 , can be computed. Conversely, given the GTG 
of the network, T can also be inferred. As a consequence, a Boolean automata 
network is given equivalently to Definition 1 by its GTG. 

Definition 1 vs. the interaction structure Let us first recall that with this 
definition, given a Boolean automata network N = J- of size n, the interaction 
structure of TV can be inferred in time 0{2 n ) (see Section 2.10). Now, comparing 
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with a possible definition of Boolean automata networks by their interaction 
structures, we find that, contrary to such a definition, Definition 1 implies a first 
implicit hypothesis: 

Hypothesis 11 An automaton either obeys all of its influences at once (when 
it is updated), or it obeys none. 

In other words, by our choice of definition, we assume that any updated automa- 
ton i e V can under no circumstances ignore some of its influences (j, i) e A. 
However, let us note that this restriction can be partially resolved by a judicious 
design of the local transition functions /». For instance, if originally, fi(x) = Xj, 
then i can be made independent of j when j is inactive by replacing fi by 
x i y Xj V Xi. This way of making i independent of j can nevertheless only 
be done relatively to a given network configuration or set of network configu- 
rations 16 . This leads us to the following remark. Definition 1 implies another 
non-trivial implicit hypothesis: 

Hypothesis 12 The result of the interactions that take place in a given config- 
uration depends only on that configuration. 

Thus, by Definition 1, we assume that given a network interaction structure 
Q = (V, A), the nature of the influence represented by the arc (j, i) e A, 
when this influence is effective, depends only on the current network configu- 
ration. Consider a network of size 2, for instance, where automaton 1 depends 
on automaton and the local transition function affected to automaton 1 is 
/i : x € B™ i-)- xq € B. Then, in configuration x = (1,0) whatever the cur- 
rent environment of the network, if the influence of automaton on automaton 
1 does take place (i.e., if automaton 1 is updated), then automaton causes 
automaton 1 to take state 1. Definition 1 and more specifically Hypothesis 12 
disallow any situation in which the active automaton deactivates automaton 
1 in configuration x. 

In spite of the disputable limitations imposed by Hypotheses 11 and 12, we still 
prefer Definition 1 to a less restrictive definition of Boolean automata networks 
by their interaction structures. The reason is mainly arbitrary but motivated, 
however, by the belief that the nature of each interaction that can potentially 
take place in a network (rather than just their existence) is an essential feature 
of a network. 

Definition 1 vs. the transition graph Similarly we have chosen not to de- 
fine a Boolean automata network by a transition graph (other than its GTG). 

16 More generally, let us suppose that fi(x) is given in CNF. For any Boolean formula 
4>(x) in CNF, let 4>(x)[0/Xj] equal the formula 4>{x) in which every occurrences of 
both the literal Xj and its negation -^Xj are replaced by (e.g., if <j)(x) — (x\ V -^X2 V 
^:r 4 ) A (x 2 V £3), then <f>(x)[0/x2] = (xi V ^£4) A x s ). Then, automaton i can be 
made to ignore the influence (j, i) £ A in configuration y € B" if fi is replaced by 
the function x h-> fi(x) V fi(y)[0/x*]. 
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In short, the reason for this is to avoid restricting a network to a system that 
has only one behaviour. Indeed, if, contrary to Definition 1, a Boolean automata 
network N is defined by a transition graph T(which is not the GTG of N), then, 
N can only behave according to T- Of course, this definition may indeed appear 
better suited for a modelling context involving steps similar to those described in 
Section 4.1 (which start with the formalisation of an observed system behaviour 
in terms of a transition graph 7). However, it is important to note again that 
in such a context, it is the transitions of T rather than the interactions of N 
that are initially designed to model a portion of reality. All the examples of Sec- 
tion 4.1 show that this difference is significant. The pertinence of the modelling 
embodied by these transitions is contingent on the circumstances in which the 
real system is observed as well as on how it is observed. In addition, as devel- 
oped in Section 4.1, the interactions that take place in the network TV are only 
inferred theoretically based on some non-trivial hypotheses such as Hypotheses 5 
to 10. The meaning, in terms of modelling, of the interactions derived this way 
is thus subjected to the relevance of the hypotheses that are made. However, 
identifying the observed transition graph with the model network N' itself fails 
at putting forward clearly these hypotheses which are instead relegated into a 
pre-accepted definition. As a consequence, Hypotheses 5 to 10, in particular, are 
made implicitly and systematically without being questioned. 

Definition 1, on the contrary, either makes irrelevant Hypotheses 5 to 10, or it 
requires that they be made explicit. Indeed, since Definition 1 is equivalent to 
defining a network by its most general behaviour, i.e., its GTG, any sub-graph of 
this transition graph can be considered as a possible elementary behaviour of the 
network. Further, the same is true for any other graph whose arcs can be broken 
down into paths that belong to the GTG. As a consequence, Boolean automata 
networks defined by Definition 1 may be considered as models of systems that 
possibly behave differently in different environments. Hypothesis 6, in particular, 
is made superfluous. More generally given a network N = T whose interactions 
are known a priori according to Definition 1 , in any transition graph representing 
the behaviour of N, all transitions have non-ambiguous causes deriving directly 
from the network definition itself. Hypotheses 5 to 10 are irrelevant in this case. 
And in a converse situation similar to those discussed in Section 4.1 in which 
the interactions of a network are to be inferred from a partial description of 
its behaviour, Definition 1 makes Hypotheses 5 to 10 obviously needed so they 
cannot be ignored. The intrinsic "incompleteness" of Definition 1 (incompleteness 
in the sense that Definition 1 does not specify the network behaviour precisely) 
imposes that any supplementary theoretical argument that is used to give a 
ruling on the network behaviour be either justified or put forward as purely 
hypothetical and convenient. 

5 Discussion 

Modelling generally involves an intermediary step that has not been mentioned 
in this paper. This step, purely theoretical, is set between theorisation and ef- 
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fective modelling. It consists in manipulating objects of the theory alone and 
making series of logical inferences to prove or simulate new theoretical proper- 
ties that will hopefully be interesting for the rest of the modelling process. Its 
main difficulty is to make proper use of the rules of classical logic commonly used 
in mathematics. Theorisation and effective modelling, on the contrary, are ubiq- 
uitous steps that require to go back and forth between reality and theory and 
that are concerned by any change brought to the theory or to the set of modelled 
characteristics of the real system. Generally, to compare reality with theory and 
to elaborate theory on the basis of reality makes indispensable a thorough and 
coherent definition and bounding of the modelling process itself. For this reason, 
theorisation and effective modelling involve a considerable difficulty of which the 
intermediary purely formal step is immune. This difficulty is to navigate safely 
but constructively between experiences and observations of reality on one side, 
and mathematical abstractions of it on the other. 

Sections 3 and 4 have both emphasised how the safeguard of consistency through- 
out the modelling process is indeed an issue. In the respective contexts of the- 
orisation and effective modelling, the intrinsic and dual incompleteness of any 
knowledge as well as of any representation of reality needs to be overcome. As 
we have endeavoured to highlight in this paper, hypothesising is a required but 
tricky solution to this. Consider Hypothesis 1, for instance. Section 3.3 argues 
that no pre-existing interpretation of the theory of Boolean automata networks 
can serve or yield a valid interpretation for this hypothesis. Generally, an ex- 
planation of a formal choice cannot be drawn solely from the theory itself (the 
theory cannot justify itself). But it cannot either rely exclusively on knowledge 
or observations of the real system or category of real systems being modelled. 
Indeed, arguments drawn from such knowledge and observations need to be con- 
sistent with the pre-defined reality /theory correspondence. Thus, to justify why, 
under Hypothesis 1 , certain possible elementary transitions are ignored and more 
precisely why the network is supposed not to follow the "maximum speed gradi- 
ent", a new feature of reality must be put forward as argued in Section 3.3. First, 
a strong supplementary notion of time must be taken into account. Second, in 
addition to this, since the fortuitousness of two events finishing simultaneously is 
not enough (see Section 3.3) to justify Hypothesis 1 non-ambiguously, one must 
also consider an absolute notion of non-simultaneity that forbids any two events 
to overlap in time. The significance of this can be seen in particular by notic- 
ing that the modelling meaning of the local interaction graphs Q{x), x £ B™, is 
quite limited under these assumptions that are intended to support Hypothe- 
sis 1. Indeed, except for their existing at the same instant, the interactions in 
these digraphs are otherwise completely independent. And this absolute non- 
simultaneity must then be implanted rigorously into the pre-defined theory and 
be connected consistently with the pre-existing reality/theory correspondence. 
The refinement represented by Hypothesis 1 thus obviously calls for another 
refinement which is the time notion brought by delays. But, as the end of Sec- 
tion 3.3 shows, the consistency requirement also calls for further refinements. 
And, in turn, so does the need for non-ambiguous and plausible predictions. 
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And each refinement adds complexity to the modelling which increases the dif- 
ficulty in the interpretation of the theory. This looming necessity for series of 
levellings of the approximations that are made by theorisation runs the risk of 
ultimately aiming towards a different modelling paradigm (in the case analysed 
in Section 3.3, the series seems to aim towards a differential formalism) which 
would necessarily disavow the present modelling paradigm by disavowing some 
of its founding hypotheses (a continuous formalism obviously denies the founding- 
hypotheses of a discrete modelling such as a Boolean modelling). Thus, model 
refining, i.e., the augmenting of the modelling and interpretation maps associ- 
ated to a theory (see Page 18) as well as augmenting the theory itself is limited 
on one hand by the necessity to maintain consistency, and on the other hand 
by the risk of increasing the complexity of the modelling and its interpretation 
to an extent where a different model is required. The existence of these limits 
favours the two following ideas. 

First, the ins and outs of a model (the theory and the reality/theory correspon- 
dence) must be understood and fathomed so that no unnecessary refinement 
be added. Thus, in a context where duration is not intended to be modelled 
specifically, one could exploit the original definition of transitions of Boolean 
automata networks to model a "duration-free" version of delays, bypassing the 
thorny problem of simultaneity. Indeed, all elementary transitions that are possi- 
ble in an arbitrary configuration x € B" (i.e., all arcs outgoing x in the GTG) may 
simply be seen as representing the possibility that in x one or several automata 
may end up ulteriorly having changed states. If i,j £ U(x) are two automata 
that can both change states in x, then the synchronous elementary transition 
x — {i,j} — ► x^'ri does not necessarily mean that the change of states of i and 
the change of states of j have finished simultaneously. Actually, it cannot mean 
this if time is not modelled (the notion of "simultaneity", in particular, is obvi- 
ously based on that of time). With the most general and basic interpretation of 
the theory of Boolean automata networks, synchronous transitions can only be 
interpreted, "at the best", as models of changes that have been produced during 
overlapping periods of time: a transition x — — ► x^- 1 '^ can be taken to 
mean that neither x 1 nor x^ are visited long enough by the network (if visited at 
all) for there to occur, in these configurations, a decisive event that could divert 
the network from its path from x to x^- 1 '^ . 

The second idea that derives from the evidence of the limits of refining a model 
concerns the intrinsic incompleteness of modelling and its benefit. Levelling the 
incompleteness of a model to make it apparently better fit reality can make it 
hard to bound properly the properties that are supposed to be modelled and 
embrace the approximations and hypotheses that were made in order to do so. 
Yet, modelling draws its pertinence precisely from its incompleteness. It is based 
on approximations of reality that can be very rough, perhaps even incorrect, and 
on the formal hypotheses to which they translate (such as the discreteness of 
events and time). Whether these hypotheses be supported by reasonable argu- 
ments or not does not necessarily impact directly on the pertinence of a model. 
Indeed, one of the main purposes of hypotheses is to be questioned and eventu- 
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ally discarded. Convincing model predictions do not produce an explanation of 
reality in themselves. An analysis of the hypotheses on which is based the model, 
however, may. On one hand, convincing model predictions suggest that the hy- 
potheses that were chosen are plausible so they can be used in the elaboration 
of an explanation (even if only an unrefined explanation) of the reality observed. 
On the other hand, the most immediate cause of a poor simulation of reality 
by a model can often be found in its underlying hypotheses which thus gain in 
being known. In addition, in more abstract lines, hypotheses are also important 
because they allow simplifications that may help formal developments. Further- 
more, their very presence carries questions that can help orient the development 
and build a mathematical understanding of the theory that is considered. Thus, 
provided they be clearly expressed, hypotheses are precisely the model features 
that can be questioned and consequently that can allow a constructive compari- 
son between reality and abstraction. In that, they represent one of the essential 
aspects of modelling, even (and perhaps especially) when they lack convincing 
modelling meaning. 

Thus, hypotheses are, we believe, the principal primary foundations of modelling. 
They embody formally the sources of inaccuracy, incompleteness and possibly 
incorrectness on which the model is developed and on which relies the pertinence 
of its predictions. A set of ill-identified approximations represents a risk that the 
series of interpretations that are drawn from a model may slightly drift in a way 
that we have no knowledge of. It may thus be responsible for an orientation of 
our understanding of reality that eludes our control. In addition, more simply 
perhaps, by nature, hypotheses carry implicit assertions of facts that are not 
questioned unless the hypotheses themselves are. Because we have no guarantee 
that a question is not pertinent until it arises and is either answered or discarded 
deliberately, to avoid implicit answers, we believe that however subtle or reason- 
able they may be, hypotheses should be clearly explicited as well as rigorously 
analysed, so far as possible. And in particular, as argued in Section 4.2, special 
attention should be brought to the design of the theory and to the acknowledge- 
ment of the hypotheses that follow implicitly from its definition choices. 

In summary, hypotheses not only are unavoidable embodiments of the impossi- 
bility of modelling a system (rather than a well-bounded set of its properties), 
they also are essential. They allow the approximations and simplifications of 
reality without which no modelling would be possible. And since they enclose 
their intrinsic incompleteness, they allow models to be effectively informative. 
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